Data replication method and storage system

ABSTRACT

A data replication method and a storage system are provided. The method is applied to a storage system including a first storage device and a second storage device. According to the method, after determining replication information, a first storage system determines a first replication sub-information and a second replication sub-information according to the replication information, where the replication information is used to indicate data that needs to be replicated by the first storage system to a second storage system in a current replication task. Then, the first storage device replicates data to the second storage system according to the second replication sub-information, and the second storage device replicates data to the second storage system according to the second replication sub-information. According to the data replication method, efficiency of replication performed between the first storage system and the second storage system can be improved.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of International Application No.PCT/CN2013/089176, filed on Dec. 12, 2013, which is hereby incorporatedby reference in its entirety.

TECHNICAL FIELD

The present disclosure relates to the field of storage technologies, andin particular, to a data replication method and a storage system.

BACKGROUND

To protect security of data, and for an objective of disaster recovery,storage vendors establish a remote disaster recovery center to implementremote data backup, so as to ensure that original data is not lost ordamaged after a disaster (such as a fire and an earthquake) occurs, andto ensure that a key service resumes within an allowed time range,thereby reducing a loss caused by the disaster as much as possible.

Current disaster recovery systems mainly include: a 2 center disasterrecovery system and a 3 data center (3DC) disaster recovery system. Inthe 2 center disaster recovery system, disaster recovery is implementedby establishing two data centers, where an active data center is used toundertake a service of a user, and a standby data center is used to backup data, a configuration, a service, and the like of the active datacenter. When a disaster occurs in the active data center, the standbydata center may take over the service of the active data center. In the3DC disaster recovery system, disaster recovery is implemented by usingstorage systems deployed at the three data centers. Generally, in the3DC disaster recovery system, the three data centers may be respectivelyreferred to as a production site, a level-1 site, and a level-2 site.Generally, the level-1 site and the level-2 site are disaster recoverysites, the production site and the level-1 site may be located in twodifferent locations in a same city, and the level-2 site and theproduction site are located in different cities. In an existing 3DCdisaster recovery system, disaster recovery is generally implementedamong the three data centers by using a synchronous remote replicationtechnology and an asynchronous remote replication technology. Becausereliability and scalability of the 3DC disaster recovery system arerelatively good, a scope of application of the 3DC disaster recoverysystem is wider.

However, in the existing 3DC redundancy system, when data needs to bereplicated to a level-2 site, generally, a production site or a level-1site replicates the data to the level-2 site; therefore, datareplication efficiency is not high.

SUMMARY

A data replication method and a storage system provided by embodimentsof the present disclosure can improve replication efficiency.

According to a first aspect, an embodiment of the present disclosureprovides a data replication method, where the method is applied to astorage system including at least a first storage device and a secondstorage device, and the method includes:

determining, by a first storage system, replication information, wherethe replication information is used to indicate data that needs to bereplicated by the first storage system to a second storage system in acurrent replication task, the first storage device and the secondstorage device that are in the first storage system store same data, andthe first storage system and the second storage system implement databackup by using an asynchronous replication technology;

determining, by the first storage system, first replicationsub-information and second replication sub-information according to thereplication information, where the first replication sub-information isused to indicate data that needs to be replicated by the first storagedevice to the second storage system in the current replication task, thesecond replication sub-information is used to indicate data that needsto be replicated by the second storage device to the second storagesystem in the current replication task, and the data indicated by thefirst replication sub-information is different from the data indicatedby the second replication sub-information;

replicating, by the first storage device to the second storage systemaccording to the first replication sub-information, the data that needsto be replicated by the first storage device to the second storagesystem; and

replicating, by the second storage device to the second storage systemaccording to the second replication sub-information, the data that needsto be replicated by the second storage device to the second storagesystem.

In a first possible implementation manner of the first aspect, thereplicating, by the first storage device to the second storage systemaccording to the first replication sub-information, the data that needsto be replicated by the first storage device to the second storagesystem includes:

replicating, by the first storage device to a destination data volume inthe second storage system according to the first replicationsub-information, the data that needs to be replicated by the firststorage device to the second storage system and that is stored in afirst source data volume; and

replicating, by the second storage device to the destination data volumein the second storage system according to the second replicationsub-information, the data that needs to be replicated by the secondstorage device to the second storage system and that is stored in asecond source data volume, where

data stored in the first source data volume is the same as data storedin the second source data volume.

With reference to the first possible implementation manner of the firstaspect, in a second possible implementation manner of the first aspect,the determining, by a first storage system, replication informationincludes:

sending, by the first storage device, a replication task start messageto the second storage device, where the replication task start messagecarries an identifier of the first source data volume and an identifierof the destination data volume;

determining, by the second storage device, the second source data volumein the second storage device according to the identifier of the firstsource data volume, the identifier of the destination data volume, and apreset replication relationship, where the replication relationshipincludes a correspondence among the first source data volume, the secondsource data volume, and the destination data volume;

determining, by the second storage device, the replication informationaccording to the data stored in the determined second source datavolume; and

determining, by the first storage device, the replication informationaccording to the data stored in the first source data volume.

With reference to the first possible implementation manner of the firstaspect, in a third possible implementation manner, the determining, by afirst storage system, replication information includes:

determining, by the first storage device in the first storage system,the replication information according to the data stored in the firstsource data volume;

sending, by the first storage device, a replication task start messageto the second storage device, where the replication task start messagecarries an identifier of the first source data volume, an identifier ofthe destination data volume, and the replication information; and

determining, by the second storage device according to the identifier ofthe first source data volume, the identifier of the destination datavolume, and a preset replication relationship, the second source datavolume corresponding to the replication information, where thereplication relationship includes a correspondence among the firstsource data volume, the destination data volume, and the second sourcedata volume.

With reference to the first aspect or any one of the first to thirdpossible implementation manners of the first aspect, in a fourthpossible implementation manner, the determining, by the first storagesystem, first replication sub-information and second replicationsub-information according to the replication information includes:determining, by the first storage device, the first replicationsub-information according to the replication information and a presetreplication policy; and determining, by the second storage device, thesecond replication sub-information according to the replicationinformation and the replication policy.

With reference to the first aspect or any one of the first to thirdpossible implementation manners of the first aspect, in a fifth possibleimplementation manner, the determining, by the first storage system,first replication sub-information and second replication sub-informationaccording to the replication information includes: receiving, by thefirst storage device, a replication negotiation request of the secondstorage device, where the replication negotiation request includes atleast bandwidth information of a link between the second storage deviceand the second storage system; determining, by the first storage device,the first replication sub-information and the second replicationsub-information according to the bandwidth information of the link; andsending, by the first storage device, the second replicationsub-information to the second storage device.

With reference to the first aspect or any one of the first to fifthpossible implementation manners of the first aspect, in a sixth possibleimplementation manner, the replication information consists of the firstreplication sub-information and the second replication sub-information,and the method further includes: in a process of executing the currentreplication task, generating, by the first storage device, firstreplication progress information according to data that has beenreplicated; sending, by the first storage device, the first replicationprogress information to the second storage device; when the firststorage device is faulty, determining, by the second storage deviceaccording to the first replication progress information, the secondreplication sub-information, and the replication information, data thathas not been replicated by the first storage device; and replicating, bythe second storage device to the second storage system, the data thathas not been replicated by the first storage device.

According to a second aspect, an embodiment of the present disclosureprovides a data replication method, where the method is applied to astorage system and includes:

receiving, by a second storage system, replication information sent by afirst storage system, where the replication information is used toindicate data that needs to be replicated by the first storage system tothe second storage system in a current replication task, the firststorage system includes at least a first storage device and a secondstorage device, the first storage device and the second storage devicestore same data, and the first storage system and the second storagesystem implement data backup by using an asynchronous replicationtechnology;

sending, by the second storage system, a first acquisition request tothe first storage device according to the replication information, wherethe first acquisition request includes information about data that needsto be acquired by the second storage system from the first storagedevice in the current replication task;

sending, by the second storage system, a second acquisition request tothe second storage device according to the replication information,where the second acquisition request includes information about datathat needs to be acquired by the second storage system from the secondstorage device in the current replication task, and the data requestedby using the first acquisition request is different from the datarequested by using the second acquisition request;

receiving, by the second storage system, data that is sent by the firststorage device according to the first acquisition request; and

receiving, by the second storage system, data that is sent by the secondstorage device according to the second acquisition request.

In a first possible implementation manner of the second aspect, theinformation, which is included in the first acquisition request, aboutthe requested data includes at least an identifier of a first sourcedata volume in the first storage device and an address of the datarequested by using the first acquisition request; and

the information, which is included in the second acquisition request,about the requested data includes at least an identifier of a secondsource data volume in the second storage device and an address of thedata requested by using the second acquisition request, where

both the first source data volume and the second source data volumestore the data that needs to be replicated by the first storage systemto the second storage system in the current replication task, and datastored in the first source data volume is the same as data stored in thesecond source data volume.

With reference to the first possible implementation manner of the secondaspect, in a second possible implementation manner of the second aspect,the receiving, by a second storage system, replication information sentby a first storage system includes:

receiving, by the second storage system, a replication task startmessage sent by the first storage system, where the replication taskstart message carries the identifier of the first source data volume andthe replication information that is determined according to the datastored in the first source data volume; and

the method further includes: determining, by the second storage system,the second source data volume in the second storage device and adestination data volume in the second storage system according to theidentifier of the first source data volume and a preset replicationrelationship, where the replication relationship includes acorrespondence among the first source data volume, the second sourcedata volume, and the destination data volume, and the destination datavolume is configured to store the data received by the second storagesystem in the current replication task.

With reference to the second aspect or the first or second possibleimplementation manner of the second aspect, in a third possibleimplementation manner, the sending, by the second storage system, afirst acquisition request to the first storage device according to thereplication information includes:

determining, by the second storage system according to the replicationinformation and bandwidth of a link between the second storage systemand the first storage device, the data that needs to be acquired fromthe first storage device; and

sending, by the second storage system, the first acquisition request tothe first storage device according to the determined data that needs tobe acquired from the first storage device; and

the sending, by the second storage system, a second acquisition requestto the second storage device according to the replication informationincludes:

determining, by the second storage system according to the replicationinformation and bandwidth of a link between the second storage systemand the second storage device, the data that needs to be acquired fromthe second storage device; and

sending, by the second storage system, the second acquisition request tothe second storage device according to the determined data that needs tobe acquired from the second storage device.

According to a third aspect, an embodiment of the present disclosureprovides a storage system, where the storage system includes at least afirst storage device and a second storage device, and the first storagedevice and the second storage device store same data, where

the storage system is configured to determine replication information,where the replication information is used to indicate data that needs tobe replicated by the storage system to another storage system in acurrent replication task, and the storage system and the another storagesystem implement data backup by using an asynchronous replicationtechnology;

the storage system is further configured to determine first replicationsub-information and a second replication sub-information according tothe replication information, where the first replication sub-informationis used to indicate data that needs to be replicated by the firststorage device to the another storage system in the current replicationtask, the second replication sub-information is used to indicate datathat needs to be replicated by the second storage device to the anotherstorage system in the current replication task, and the data indicatedby the first replication sub-information is different from the dataindicated by the second replication sub-information;

the first storage device is configured to replicate, to the anotherstorage system according to the first replication sub-information, thedata that needs to be replicated by the first storage device to theanother storage system; and

the second storage device is configured to replicate, to the anotherstorage system according to the second replication sub-information, thedata that needs to be replicated by the second storage device to theanother storage system.

In a first possible implementation manner of the third aspect, the firststorage device is configured to replicate, to a destination data volumein the another storage system according to the first replicationsub-information, the data that needs to be replicated by the firststorage device to the another storage system and that is stored in afirst source data volume; and

the second storage device is configured to replicate, to the destinationdata volume in the another storage system according to the secondreplication sub-information, the data that needs to be replicated by thesecond storage device to the another storage system and that is storedin a second source data volume, where

data stored in the first source data volume is the same as data storedin the second source data volume.

With reference to the first possible implementation manner of the thirdaspect, in a second possible implementation manner of the third aspect,the first storage device is further configured to send a replicationtask start message to the second storage device, where the replicationtask start message carries an identifier of the first source data volumeand an identifier of the destination data volume;

the second storage device is further configured to: determine the secondsource data volume in the second storage device according to theidentifier of the first source data volume, the identifier of thedestination data volume, and a preset replication relationship, anddetermine the replication information according to the data stored inthe determined second source data volume, where the replicationrelationship includes a correspondence among the first source datavolume, the destination data volume, and the second source data volume;and

the first storage device is further configured to determine thereplication information according to the data stored in the first sourcedata volume.

With reference to the first possible implementation manner of the thirdaspect, in a third possible implementation manner of the third aspect,the first storage device is further configured to: determine thereplication information according to the data stored in the first sourcedata volume, and send a replication task start message to the secondstorage device, where the replication task start message carries anidentifier of the first source data volume, an identifier of thedestination data volume, and the replication information; and

the second storage device is further configured to determine, accordingto the identifier of the first source data volume, the identifier of thedestination data volume, and a preset replication relationship, thesecond source data volume corresponding to the replication information,where the replication relationship includes a correspondence among thefirst source data volume, the destination data volume, and the secondsource data volume.

With reference to the third aspect or any one of the first to thirdpossible implementation manners of the third aspect, in a fourthpossible implementation manner of the third aspect, the first storagedevice is further configured to determine the first replicationsub-information according to the replication information and a presetreplication policy; and the second storage device is further configuredto determine the second replication sub-information according to thereplication information and the replication policy.

With reference to the third aspect or any one of the first to thirdpossible implementation manners of the third aspect, in a fifth possibleimplementation manner of the third aspect, the first storage device isfurther configured to: receive a replication negotiation request of thesecond storage device, where the replication negotiation requestincludes at least bandwidth information of a link between the secondstorage device and the another storage system; determine the firstreplication sub-information and the second replication sub-informationaccording to the bandwidth information of the link; and send the secondreplication sub-information to the second storage device.

With reference to the third aspect or any one of the first to fifthpossible implementation manners of the third aspect, in a sixth possibleimplementation manner of the third aspect, the replication informationconsists of the first replication sub-information and the secondreplication sub-information; the first storage device is furtherconfigured to: in a process of executing the current replication task,generate first replication progress information according to data thathas been replicated, and send the first replication progress informationto the second storage device; and

the second storage device is further configured to: when the firststorage device is faulty, determine, according to the first replicationprogress information, the second replication sub-information, and thereplication information, data that has not been replicated by the firststorage device, and replicate, to the another storage system, the datathat has not been replicated by the first storage device.

According to a fourth aspect, an embodiment of the present disclosureprovides a storage system, including:

a receiving module, configured to receive replication information sentby another storage system, where the replication information is used toindicate data that needs to be replicated by the another storage systemto the storage system in a current replication task, the another storagesystem includes at least a first storage device and a second storagedevice, the first storage device and the second storage device storesame data, and the storage system and the another storage systemimplement data backup by using an asynchronous replication technology;and

a sending module, configured to send a first acquisition request to thefirst storage device according to the replication information, where thefirst acquisition request includes information about data that needs tobe acquired by the storage system from the first storage device in thecurrent replication task, where

the sending module is further configured to send a second acquisitionrequest to the second storage device according to the replicationinformation, where the second acquisition request includes informationabout data that needs to be acquired by the storage system from thesecond storage device in the current replication task, and the datarequested by using the first acquisition request is different from thedata requested by using the second acquisition request; and

the receiving module is further configured to receive data that is sentby the first storage device according to the first acquisition request,and receive data that is sent by the second storage device according tothe second acquisition request.

In a first possible implementation manner of the fourth aspect, theinformation, which is included in the first acquisition request, aboutthe requested data includes at least an identifier of a first sourcedata volume in the first storage device and an address of the datarequested by using the first acquisition request; and

the information, which is included in the second acquisition request,about the requested data includes at least an identifier of a secondsource data volume in the second storage device and an address of thedata requested by using the second acquisition request, where

both the first source data volume and the second source data volumestore the data that needs to be replicated by the another storage systemto the storage system in the current replication task, and data storedin the first source data volume is the same as data stored in the secondsource data volume.

With reference to the first possible implementation manner of the fourthaspect, in a second possible implementation manner of the fourth aspect,the receiving module is further configured to receive a replication taskstart message sent by the another storage system, where the replicationtask start message carries the identifier of the first source datavolume and the replication information that is determined according tothe data stored in the first source data volume; and

the storage system further includes: a message processing module,configured to determine the second source data volume in the secondstorage device and a destination data volume in the storage systemaccording to the identifier of the first source data volume and a presetreplication relationship, where

the replication relationship includes a correspondence among the firstsource data volume, the second source data volume, and the destinationdata volume, and the destination data volume is configured to store thedata received by the storage system in the current replication task.

With reference to the fourth aspect or either of the first and secondpossible implementation manners of the fourth aspect, in a thirdpossible implementation manner of the fourth aspect, the storage systemfurther includes:

a determining module, configured to: determine, according to thereplication information and bandwidth of a link between the storagesystem and the first storage device, the data that needs to be acquiredfrom the first storage device, and determine, according to thereplication information and bandwidth of a link between the storagesystem and the second storage device, the data that needs to be acquiredfrom the second storage device, where

the sending module is specifically configured to send the firstacquisition request to the first storage device according to the datathat is determined by the determining module and that needs to beacquired from the first storage device, and send the second acquisitionrequest to the second storage device according to the data that isdetermined by the determining module and that needs to be acquired fromthe second storage device.

According to a fifth aspect, an embodiment of the present disclosureprovides still another storage system, including a controller and astorage, where the storage is configured to store data sent by anotherstorage system; and the controller includes:

a communications interface, configured to communicate with the anotherstorage system;

a memory, configured to store a computer executable instruction; and

a processor, configured to run the computer executable instruction, toperform the method according to the foregoing second aspect.

According to a sixth aspect, an embodiment of the present disclosureprovides a computer program product, including a computer readablestorage medium that stores program code, where an instruction includedin the program code is used to perform the method according to theforegoing first aspect.

According to a seventh aspect, an embodiment of the present disclosureprovides a computer program product, including a computer readablestorage medium that stores program code, where an instruction includedin the program code is used to perform the method according to theforegoing second aspect.

In the data replication method provided by the embodiments of thepresent disclosure, because a first storage device and a second storagedevice that are in a first storage system store same data, in a processof replication from the first storage system to a second storage system,the first storage system may determine first replication sub-informationand second replication sub-information according to replicationinformation in a current replication task, the first storage devicereplicates data to the second storage system according to the firstreplication sub-information, and the second storage device replicatesdata to the second storage system according to the second replicationsub-information. According to the method provided by the embodiments ofthe present disclosure, one replication task of the first storage systemcan be shared by the first storage device and the second storage device,so that efficiency of replication performed between the first storagesystem and the second storage system can be improved without increasingproduction costs.

BRIEF DESCRIPTION OF DRAWINGS

To describe the technical solutions in the embodiments of the presentdisclosure more clearly, the following briefly introduces theaccompanying drawings required for describing the embodiments.Apparently, the accompanying drawings in the following description showmerely some embodiments of the present disclosure.

FIG. 1 is a schematic diagram of an application scenario according to anembodiment of the present disclosure;

FIG. 2 is a schematic structural diagram of a storage device accordingto an embodiment of the present disclosure;

FIG. 3 is a flowchart of a data replication method according to anembodiment of the present disclosure;

FIG. 3a is a schematic diagram of a replication bitmap according to anembodiment of the present disclosure;

FIG. 3b is a flowchart of a replication information determining methodaccording to an embodiment of the present disclosure;

FIG. 3c is a flowchart of another replication information determiningmethod according to an embodiment of the present disclosure;

FIG. 4a and FIG. 4b are a signaling diagram of still another datareplication method according to an embodiment of the present disclosure;

FIG. 5 is a flowchart of still another data replication method accordingto an embodiment of the present disclosure;

FIG. 6a and FIG. 6b are a signaling diagram of still another datareplication method according to an embodiment of the present disclosure;and

FIG. 7 is a schematic structural diagram of a storage system accordingto an embodiment of the present disclosure.

DESCRIPTION OF EMBODIMENTS

To make a person skilled in the art understand the solutions in thepresent disclosure better, the following clearly describes the technicalsolutions in the embodiments of the present disclosure with reference tothe accompanying drawings in the embodiments of the present disclosure.Apparently, the described embodiments are merely some but not all of theembodiments of the present disclosure.

A data replication method provided by the embodiments of the presentdisclosure is mainly applied to a disaster recovery system with multipledata centers. The disaster recovery system with multiple data centersthat is described in the embodiments of the present disclosure refers toa disaster recovery system including three or more data centers. Forease of description, the embodiments of the present disclosure aredescribed by using a 3 data center (3DC) disaster recovery system as anexample. FIG. 1 is a schematic diagram of an application scenarioaccording to an embodiment of the present disclosure. A 3DC disasterrecovery system shown in FIG. 1 includes at least one host 100 and threedata centers. The three data centers include at least one productionsite and two disaster recovery sites. For ease of description, in thisembodiment of the present disclosure, the three data centers shown inFIG. 1 are respectively referred to as a production site 11, a level-1site 12, and a level-2 site 13, where the level-1 site 12 and thelevel-2 site 13 are generally the disaster recovery sites. The threedata centers may be connected to each other by using a fiber or anetwork cable in a star-shaped networking manner. The three data centersmay perform data transmission with each other by using the IP (InternetProtocol) protocol or an FC (Fiber Channel) protocol. In this embodimentof the present disclosure, the host 100 may communicate with theproduction site 11 or the level-1 site 12 based on the Small ComputerSystem Interface (SCSI) Protocol or based on the Internet Small ComputerSystem Interface (iSCSI) Protocol, which is not limited herein.

The production site 11 includes a first storage device 110, the level-1site 12 includes a second storage device 120, and the level-2 site 13includes a third storage device 130. The first storage device 110, thesecond storage device 120, and the third storage device 130 may bestorage devices such as a storage array or a server known in currenttechnologies. For example, the first storage device 110, the secondstorage device 120, and the third storage device 130 may include astorage area network (SAN) array, or may include a network attachedstorage (NAS) array. A specific form of a storage device in each datacenter is not limited in this embodiment of the present disclosure. Itshould be noted that all methods in the embodiments of the presentdisclosure are performed by the storage devices in these sites. For easeof description, in this embodiment of the present disclosure, if nototherwise stated, the production site 11 refers to the first storagedevice 110 in the production site 11, the level-1 site 12 refers to thesecond storage device 120 in the level-1 site 12, and the level-2 site13 refers to the third storage device 130 in the level-2 site 13.

In the application scenario shown in FIG. 1, a distance between theproduction site 11 and the level-1 site 12 is relatively short.Generally, the distance between the production site 11 and the level-1site 12 may be less than 100 km. For example, the production site 11 andthe level-1 site 12 may be located in two different locations in a samecity. A distance between the level-2 site 13 and the production site 11or the level-1 site 12 is relatively long. Generally, the distancebetween the level-2 site 13 and the production site 11 may be at least1000 km. For example, the level-2 site 13 and the production site 11 maybe located in different cities. Certainly, in this embodiment of thepresent disclosure, the production site 11 and the level-1 site 12 arenot necessarily limited to be in a same city, as long as synchronousreplication of data between the production site 11 and the level-1 site12 can be implemented.

The host 100 may include any computing device known in currenttechnologies, such as a server, a desktop computer, or an applicationserver. An operating system and another application program areinstalled in the host 100. There may be multiple hosts 100.

In the application scenario shown in FIG. 1, the level-2 site 13 ismainly used as a disaster recovery site, and is configured to back updata. The host 100 does not access the level-2 site 13. Both theproduction site 11 and the level-1 site 12 may receive an access requestfrom the host 100. In a situation, the host 100 may write data only tothe production site 11. For example, one or more hosts 100 may perform adata write operation on the production site 11. In another situation,one host 100 may also write different data separately to the productionsite 11 and the level-1 site 12. In still another situation, differenthosts 100 may perform a data write operation on the production site 11and the level-1 site 12 respectively. For example, a host A performs adata write operation on the production site 11, and a host B alsoperforms a data write operation on the level-1 site 12. It isunderstandable that, in the latter two situations, because theproduction site 11 and the level-1 site 12 may simultaneously undertakea service of the host 100, efficiency for reading and writing data canbe improved. In this embodiment of the present disclosure, whether theproduction site 11 and the level-1 site 12 simultaneously undertake theservice of the host 100 is not limited, as long as synchronousreplication can be implemented between the production site 11 and thelevel-1 site 12 and real-time synchronization of data in the productionsite 11 and the level-1 site 12 can be ensured. It should be noted that,because in the disaster recovery system shown in FIG. 1, both theproduction site 11 and the level-1 site 12 may receive an access requestfrom the host 100, and the production site 11 and the level-1 site 12keep consistency of stored data in a synchronous replication manner,roles of the production site 11 and the level-1 site 12 areinterchangeable.

In this embodiment of the present disclosure, the production site 11 andthe level-1 site 12 may keep, by using a synchronous replicationtechnology, data stored in the production site 11 and data stored in thelevel-1 site 12 synchronized in real time. For example, when the host100 writes data to the production site 11, the production site 11 maysimultaneously back up the data to the level-1 site 12. After the datais written to both the production site 11 and the level-1 site 12, theproduction site 11 returns a write success response to the host 100, soas to keep the data in the production site 11 and the data in thelevel-1 site 12 synchronized. Because the distance between the level-2site 13 and the production site 11 is relatively long, the productionsite 11 or the level-1 site 12 may store the data in the level-2 site 13by using an asynchronous replication technology. For example, when thehost 100 writes data to the production site 11, the production site 11may directly return a write success response to the host 100. After aperiod of time, the production site 11 sends, to the level-2 site 13 forstorage, the data written by the host 100, so as to implement furtherbackup on the data. It should be noted that, in this embodiment of thepresent disclosure, the writing data to the production site 11 may bewriting the data to a cache of the production site 11, or may refer towriting the data to a storage of the production site 11, which is notlimited herein.

It should be noted that, a storage space formed by the storage of theproduction site 11 may include multiple data volumes. The data volume inthis embodiment of the present disclosure is a logical storage spaceformed by mapping a physical storage space. For example, the data volumemay be a logic unit identified by a logical unit number (LUN), or may bea file system. It is understandable that, a storage space of the level-1site 12 or a storage space of the level-2 site 13 may also includemultiple data volumes.

In an actual application, in the 3DC disaster recovery system, data isgenerally kept synchronized between the production site 11 and thelevel-1 site 12 by using the synchronous replication technology, andasynchronous replication is established between the production site 11and the level-2 site 13 or asynchronous replication is establishedbetween the level-1 site 12 and the level-2 site 13. For example, in anasynchronous replication process, asynchronous replication is performedby using a link between the production site 11 and the level-2 site 13,and a link between the level-1 site 12 and the level-2 site 13 is usedas a backup link. For ease of description, in this embodiment of thepresent disclosure, the link between the production site 11 and thelevel-2 site 13 is referred to as a first link, and the link between thelevel-1 site 12 and the level-2 site 13 is referred to as a second link.When an exception occurs in a process of replication between theproduction site 11 and the level-2 site 13, data may be re-replicatedfrom the level-1 site 12 to the level-2 site 13. That the exceptionoccurs in the replication process includes: an exception occurs in thereplication process because a fault occurs on the production site 11, oran exception occurs in the replication process because a fault occurs onthe first link. It is understandable that, in the asynchronousreplication process, asynchronous replication may also be performed byusing the link between the level-1 site 12 and the level-2 site 13, andthe link between the production site 11 and the level-2 site 13 is usedas a backup link.

However, because the data in the production site 11 changes greatly, inthe asynchronous replication process, if asynchronous replication isperformed by using only the first link, bandwidth of the first link maybecome a bottleneck of a data backup process. In an actual application,if a first link with greater bandwidth is used to perform backup, costsof a user may increase. To improve bandwidth for data backup withoutincreasing costs, in this embodiment of the present disclosure, whenasynchronous replication needs to be performed, the production site 11and the level-1 site 12 simultaneously perform asynchronous replicationto the level-2 site 13. In this manner, in the asynchronous replicationprocess, both the first link and the second link are in an active state.

Because this embodiment of the present disclosure mainly involves how toreplicate data from the production site 11 and the level-1 site 12 tothe level-2 site 13, for ease of description, in the followingembodiments, a storage system including the first storage device 110 andthe second storage device 120 that send data in a replication process isreferred to as a first storage system 33, and a storage system to whichthe third storage device 130 that receives data belongs is referred toas a second storage system 44. It is understandable that the firststorage system 33 may further include another storage device, and thesecond storage system 44 may also further include another storagedevice.

A structure of the first storage device 110, the second storage device120, and the third storage device 130 shown in FIG. 1 may be shown inFIG. 2. FIG. 2 is a schematic structural diagram of a storage device 20according to an embodiment of the present disclosure. The storage device20 shown in FIG. 2 is a storage array. As shown in FIG. 2, the storagedevice 20 may include a controller 200 and a disk array 214, where thedisk array 214 herein is configured to provide a storage space, and mayinclude a redundant array of independent disks (RAID) or a disk chassisincluding multiple disks. There may be multiple disk arrays 214, and thedisk array 214 includes multiple disks 216. The disk 216 is configuredto store data. The disk array 214 is in communication connection withthe controller 200 by using a communication protocol, for example, theSCSI Protocol, which is not limited herein.

It is understandable that the disk array 214 is merely an example of astorage in the storage system. In this embodiment of the presentdisclosure, data may also be stored by using a storage, for example, atape library. It should be noted that the disk 216 is also merely anexample of a memory for building the disk array 214. In an actualapplication, there may further be an implementation manner, for example,for building a disk array between cabinets including multiple disks.Therefore, in this embodiment of the present disclosure, the disk array214 may also include a storage including a non-volatile storage mediumsuch as a solid state disk (SSD), a cabinet including multiple disks, ora server, which is not limited herein.

The controller 200 is a “brain” of the storage device 20, and mainlyincludes a processor 202, a cache 204, a memory 206, a communicationsbus (a bus for short) 210, and a communications interface 212. Theprocessor 202, the cache 204, the memory 206, and the communicationsinterface 212 communicate with each other by using the communicationsbus 210. It should be noted that, in this embodiment of the presentdisclosure, there may be one or more controllers 200 in the storagedevice 20. It is understandable that, when the storage device 20includes at least two controllers 200, stability of the storage device20 may be improved.

The communications interface 212 is configured to communicate with thehost 100, the disk 216, or another storage device.

The memory 206 is configured to store a program 208. The memory 206 mayinclude a high-speed RAM memory, or may further include a non-volatilememory, for example, at least one magnetic disk storage. It isunderstandable that the memory 206 may be various non-transitory machinereadable media, such as a random access memory (RAM), a magnetic disk, ahard disk drive, an optical disc, a SSD, or a non-volatile memory, thatcan store program code.

The program 208 may include program code, and the program code includesa computer operation instruction.

The cache 204 is a storage between the controller and the hard diskdrive, and has a capacity smaller than that of the hard disk drive but aspeed faster than that of the hard disk drive. The cache 204 isconfigured to temporarily store data received from the host 100 oranother storage device and temporarily store data read from the disk216, so as to improve performance and reliability of the array. Thecache 204 may be various non-transitory machine readable media, such asa RAM, a ROM, a flash memory, or a SSD, that can store data, which isnot limited herein.

The processor 202 may be a central processing unit CPU or anapplication-specific integrated circuit ASIC (Application-SpecificIntegrated Circuit), or is configured as one or more integrated circuitsthat implement this embodiment of the present disclosure. An operatingsystem and another software program are installed in the processor 202,and different software programs may be considered as differentprocessing module, and have different functions, such as processing aninput/output (I/O) request for the disk 216, performing anotherprocessing on data in the disk 216, or modifying metadata saved in thestorage device 20. Therefore, the controller 200 can implement variousdata management functions, such as an IO operation, a snapshot,mirroring, and replication. In this embodiment of the presentdisclosure, the processor 202 is configured to execute the program 208,and specifically, may perform relevant steps in the following methodembodiments.

It is understandable that, in this embodiment of the present disclosure,hardware structures of the first storage device 110, the second storagedevice 120, and the third storage device 130 may be similar. However,because the first storage device 110, the second storage device 120, andthe third storage device 130 undertake different functions in a datareplication process, a processor 202 in the first storage device 110, aprocessor 202 in the second storage device 120, and a processor 202 inthe third storage device 130 may execute different programs 208. Thefollowing describes in detail how the storage devices in this embodimentof the present disclosure specifically implement a data replicationmethod.

FIG. 3 is a flowchart of a data replication method according to anembodiment of the present disclosure. The method may be applied to theapplication scenario shown in FIG. 1. The method shown in FIG. 3 isdescribed from a perspective of a first storage system 33 that sendsdata. The first storage system 33 may include the first storage device110 in the production site 11 and the second storage device 120 in thelevel-1 site 12 that are shown in FIG. 1. Data stored in the firststorage device 110 is the same as data stored in the second storagedevice 120. A second storage system 44 that receives replicated data mayinclude a third storage device 130. Hardware structures of both thefirst storage device 110 and the second storage device 120 in thisembodiment of the present disclosure may be both shown in FIG. 2.Specifically, the method in FIG. 3 may be jointly performed by aprocessor 202 in the first storage device 110 and a processor 202 in thesecond storage device 120. The following describes the data replicationmethod shown in FIG. 3 with reference to FIG. 1 and FIG. 2. As shown inFIG. 3, the method may include the following steps.

In step 300, the first storage system 33 determines replicationinformation according to stored data, where the replication informationis used to indicate data that needs to be replicated by the firststorage system 33 to the second storage system 44 in a currentreplication task. The replication task (which may also be referred to asa remote asynchronous replication task) refers to that the first storagesystem 33 replicates, to a data volume in the second storage system 44,data carried in a write data command that is received by a data volumein the first storage system 33 in a period of time. In an actualapplication, the first storage system 33 may send all data in the datavolume in the period of time to the second storage system 44, or maysend differential data (which is also referred to as incremental data)that is relative to a last replication task and that is received in theperiod of time to the second storage system 44, which is not limitedherein.

It should be noted that, in a situation, because the first storagedevice 110 or the second storage device 120 may have multiple datavolumes, there may be multiple replication tasks at the same time. Inthis embodiment of the present disclosure, replication task identifiersmay be used to distinguish different replication tasks, where thereplication task identifier may be determined according to an identifierof a source data volume and an identifier of a destination data volumein a current replication process. For example, the replication taskidentifier may be A LUN001 B LUN002, used to represent that thereplication task is to replicate data from a LUN with an identifier of001 in a storage device A to a LUN with an identifier of 002 in astorage device B. In addition, the replication task identifier may alsobe represented by another identifier, which is not limited herein, aslong as the storage devices can identify the source data volume and thedestination data volume that are in the current replication task.

It is understandable that, in this embodiment of the present disclosure,although the first storage device 110 and the second storage device 120keep consistency of stored data in a synchronous replication manner,identifiers of data volumes, in the first storage device 110 and thesecond storage device 120, storing same data are not necessarily thesame. For example, an ID of a LUN, in the first storage device 110,storing data A may be LUN 1#, and an ID of a LUN, in the second storagedevice 120, storing the data A may be LUN 2#. In an actual application,a replication relationship among data volumes in the first storagedevice 110, the second storage device 120, and the third storage device130 may be pre-configured, so that all of the first storage device 110,the second storage device 120, and the third storage device 130 maydetermine a corresponding source data volume and destination data volumein the current replication task according to the replicationrelationship. In other words, data backup may be implemented betweendata volumes having a replication relationship.

A set replication relationship may include a replication relationshipidentifier, or may include identifiers of LUNs in the first storagedevice 110, the second storage device 120, and the third storage device130. For example, the replication relationship identifier may be: ALUN001 B LUN002 C LUN003, used to represent that there is a replicationrelationship among a LUN with an identifier of 001 in a storage deviceA, a LUN with an identifier of 002 in a storage device B, and a

LUN with an identifier of 003 in a storage device C. Certainly, thereplication relationship identifier may also be represented by anotheridentifier, and no limitation is made herein, as long as the storagedevices can identify the replication relationship involved in thecurrent replication task. In an actual application, the replicationrelationship may be saved in advance in all of the first storage device110, the second storage device 120, and the third storage device 130.

In this embodiment of the present disclosure, the replication task is areplication task between LUNs in different storage devices that isstarted according to a preset replication relationship; at a moment,there may be only one replication task for data volumes having areplication relationship. This embodiment of the present disclosuremainly involves a process of an interaction between devices in areplication task. Therefore, in this embodiment of the presentdisclosure, a replication task may be represented by a replicationrelationship identifier. In other words, in this embodiment of thepresent disclosure, the replication task identifier may be the same asthe replication relationship identifier. Certainly, in an actualapplication, to distinguish multiple replication tasks, existing indifferent time periods, of data volumes having a replicationrelationship, the replication task identifier may also be different fromthe replication relationship identifier. For example, a time identifiermay be added to the replication relationship identifier, to form thereplication task identifier, and is used to represent that thereplication task is a replication task, started at a different moment,between data volumes having a replication relationship.

It is understandable that, in a situation, if only one data volume isstored in a storage space of each of the first storage device 110, thesecond storage device 120, and the third storage device 130, the currentreplication task is a replication task started at a current moment for aunique data volume in each storage device. Therefore, the replicationrelationship among the LUNs in the storage devices may also not bepreset.

In this embodiment of the present disclosure, the replicationinformation may be determined according to differential data informationreceived by the first storage device 110 or the second storage device120. The differential data information refers to information about datawritten to a storage device after a previous replication task of thecurrent replication task begins and before the current replication taskbegins. In an actual application, the differential data information maybe recorded by using a differential bitmap. The first storage device 110is used as an example, and the first storage device 110 may create adifferential bitmap for each data volume (for example, a LUN), forrecording information about data written to the data volume. FIG. 3ashows an example of a differential bitmap according to an embodiment ofthe present disclosure. As shown in FIG. 3a , each grid in thedifferential bitmap corresponds to an address in the LUN. In thisembodiment of the present disclosure, a flag bit “1” is used torepresent that a data write occurs. Because a write data command sent bya host 100 may carry data needing to be written and an address of thedata, after receiving the write data command sent by the host 100, thefirst storage device 110 may set a flag bit in a corresponding grid inthe differential bitmap to “1” according to the address of the datacarried in the write data command. Information about changed datawritten to the LUN can be recorded in the manner of performing recordingby using a differential bitmap. It is understandable that, in samereplication duration, if data of a same address is changed continuously,when recording is performed, a flag bit in a grid corresponding to theaddress of the data may always be set to “1”. It is understandable that,in another situation, a flag bit “0” may also be used to represent thata data write occurs. In still another situation, alternatively, “1” maybe used to represent that a data write occurs, and “0” to represent thatno data write occurs. In addition, whether a data write occurs may alsobe represented by using another flag bit, and a specific form of theflag bit is not limited herein. In this embodiment of the presentdisclosure, the differential bitmap may be saved in a cache of the firststorage device 110 or the second storage device 120, or may be saved ina storage of the first storage device 110 or the second storage device120.

It is understandable that, in addition to being recorded in a form ofthe differential bitmap, the differential data information may furtherbe recorded in a tree structure. For example, the differential datainformation may be recorded by using a differential binary tree, adifferential B+ tree, or another tree. When the differential datainformation is recorded by using the tree structure, each leaf node maycorrespond to an address in a LUN. When a data write occurs, a flag bitof the leaf node is set to a corresponding identifier, for representingthat a data write occurs for an address corresponding to the leaf node.In still another situation, the differential data information may alsobe recorded in a structure of a linked list or an entry. When thedifferential data information is represented in a form of a linked list,each entry in the list corresponds to an address in a LUN. In addition,the differential data information may further be recorded in a form of alog and the like, which is not limited herein. Similar to thedifferential bitmap, the foregoing differential data informationrecorded in the form of the tree structure, the linked list, the log,and the like may be saved in the cache of the first storage device 110or the second storage device 120, or may be saved in the storage of thefirst storage device 110 or the second storage device 120.

It is understandable that, when the differential data information isrecorded in the form of the differential bitmap, the replicationinformation may also be represented in a form of a replication bitmap.When the differential data information is recorded in a form of adifferential tree, the replication information may also be representedin a form of a replication tree, and so on, and details are notdescribed herein again.

In this embodiment of the present disclosure, for ease of description, adescription is provided below by using a differential bitmap as anexample, and a flag bit “1” is used to represent that a data writeoccurs. When a replication task is started, in a situation, the firststorage device 110 may convert a differential bitmap when thereplication task is started into a replication bitmap, to determine thereplication information, and re-generate an empty differential bitmap torecord data that is written to the LUN in the first storage device 110after the current replication task is started. The replication bitmap isused to represent data that needs to be replicated to the third storagedevice 130 in the current replication task. In this manner, duringreplication, data of an address corresponding to a grid “1” in thereplication bitmap may be replicated to the third storage device 130. Inanother situation, a differential bitmap and a replication bitmap may beset in the first storage device 110, and the differential bitmap is usedto record information about data written to the first storage deviceafter a previous replication task of the current replication task beginsand before the current replication task begins. When the currentreplication task is started, the differential bitmap may be convertedinto a replication bitmap in the current replication task, and areplication bitmap that is used up in the previous replication task isconverted into a new differential bitmap to record information aboutdata written after the current replication task begins. It should benoted that, when the replication bitmap that is used up in the previousreplication task is converted into the new differential bitmap, flagbits in the replication bitmap in the previous replication task need tobe cleaned, and then a cleaned replication bitmap is used as the newdifferential bitmap. In other words, in this situation, the differentialbitmap and the replication bitmap may be used alternately, and arerespectively used to record the differential data information andinformation about the data to be replicated to the third storage device130. A manner for specifically determining the replication informationaccording to the differential data information is not limited herein.

As described above, in the 3DC disaster recovery system shown in FIG. 1,because the first storage device 110 and the second storage device 120that are in the first storage system 33 keep consistency of stored databy using a synchronous replication technology, when it is determinedthat a replication task between the first storage system 33 and thesecond storage system 44 needs to be started, and in other words, whenit is determined that asynchronous replication to the third storagedevice 130 needs to be started, the first storage system 33 needs todetermine the replication information, for indicating the data thatneeds to be replicated by the first storage system 33 to the secondstorage system 44 in the current replication task. In a situation, thatthe first storage system 33 determines the replication information maybe specifically: the first storage device 110 and the second storagedevice 120 may separately determine the replication informationaccording to the stored data. Because the data stored in the firststorage device 110 is the same as the data stored in the second storagedevice 120, the replication information determined by the first storagedevice 110 according to the stored data is the same as the replicationinformation determined by the second storage device 120 according to thestored data. Specifically, in this embodiment of the present disclosure,a description is provided by using an example in which the currentreplication task is a replication task started among a first source datavolume in the first storage device 110, a second source data volume inthe second storage device 120, and a destination data volume in thethird storage device 130. Data stored in the first source data volume isthe same as data stored in the second source data volume. FIG. 3b is aflowchart of a replication information determining method according toan embodiment of the present disclosure. As shown in FIG. 3b , themethod may include the following steps.

In step 310, the first storage device 110 sends a replication task startmessage to the second storage device 120. The replication task startmessage carries an identifier of the first source data volume in thefirst storage device 110 and an identifier of the destination datavolume in the third storage device 130. The replication task startmessage is used to notify the second storage device 120 that the currentreplication task is a replication task between the first source datavolume in the first storage device 110 and the destination data volumein the third storage device 130.

The replication task start message may be in a pre-defined messageformat. For example, a header of the replication task start message mayinclude a message type (for example, opCode) field, a source device ID(for example, srcAppld) field, and a destination device ID (for example,dstAppld) field. The message type field is used to represent that a typeof the message is a replication task start message. The source device IDfield is used to identify an initiator of the replication task startmessage, and the destination device ID field is used to identify areceiver of the replication task start message. Both the source deviceID field and the destination device ID field may be identified by usingan IP address. A format of a content part (for example, a data field ofthe replication task start message) of the replication task startmessage may be shown as follows:

Byte 0 1 2 3 0 LUN ID 4 ReplicationObject LUN id

where LUN ID may be located in the zeroth to the third bytes, and isused to represent an identifier of a source data volume in the currentreplication task, where the LUN identifier is a unique identifier; and

ReplicationObject LUN id may be located in the seventh to the eighthbytes, is used to represent an identifier of a destination data volumein a destination device that receives data in the current replicationtask, and may be, for example, a LUN ID of the destination device; inother words, the field is used to represent a destination data volume ina destination storage device to which the current replication taskpoints to.

For example, in this step, in the replication task start message sent bythe first storage device 110 to the third storage device 130, the “LUNID” field may be an ID, for example, LUN001, of the first source datavolume, in the first storage device 110, storing to-be-replicated data,and the “ReplicationObject LUN id” field may be an ID, for example,LUN003, of the destination data volume in the third storage device 130,so as to indicate that the current replication task is a replicationtask between a LUN with an identifier of 001 in the first storage device110 and a LUN with an identifier of 003 in the third storage device 130.In still another situation, the “LUN ID” field may also be an ID of thesecond source data volume in the second storage device 120. In thiscase, the first storage device 110 may determine an identifier of thesecond source data volume in advance according to a preset replicationrelationship, the identifier of the first source data volume, and theidentifier of the destination data volume.

In step 312, the second storage device 120 determines the second sourcedata volume in the second storage device according to the identifier ofthe first source data volume, the identifier of the destination datavolume, and a preset replication relationship. The replicationrelationship includes a correspondence among the first source datavolume, the second source data volume, and the destination data volume.As described above, in this embodiment of the present disclosure, thereplication relationship among the data volumes is preset in each of thefirst storage device 110, the second storage device 120, and the thirdstorage device 130. After the second storage device 120 receives thereplication task start message of the first storage device 110, thesecond storage device 120 may determine the second source data volume inthe second storage device 120 according to the preset replicationrelationship, and the identifier of the first source data volume and theidentifier of the destination data volume that are carried in thereplication task start message.

In step 314, the second storage device 120 determines the replicationinformation according to the data stored in the determined second sourcedata volume. In step 316, the first storage device 110 determines thereplication information according to the data stored in the first sourcedata volume. For example, the second storage device 120 may generate areplication bitmap according to a differential bitmap of the secondsource data volume, and the first storage device 110 may generate areplication bitmap according to a differential bitmap of the firstsource data volume. For how the first storage device 110 and the secondstorage device 120 specifically determine the replication informationaccording to the data stored in the data volumes, reference may be madeto the foregoing description, and details are not described hereinagain. Because the data stored in the second source data volume is thesame as the data stored in the first source data volume, the replicationinformation determined by the second storage device 120 according to thesecond source data volume is the same as the replication informationdetermined by the first storage device 110 according to the first sourcedata volume, and the replication information is the replicationinformation determined by the first storage system 33 in the currentreplication task, and is used to indicate the data that needs to bereplicated by the first storage system 33 to the second storage system44 in the current replication task.

In another situation, that the first storage system 33 determines thereplication information may also be specifically: the first storagedevice 110 in the first storage system 33 determines the replicationinformation according to the data stored in the first storage device110, and then sends the determined replication information to the secondstorage device 120. In this manner, the first storage device 110 and thesecond storage device 120 each can have the same replicationinformation. Specifically, as shown in FIG. 3c , FIG. 3c is a flowchartof still another replication information determining method according toan embodiment of the present disclosure. The replication informationdetermining method shown in FIG. 3c is described still by using anexample in which the current replication task is a replication taskstarted among the first source data volume in the first storage device110, the second source data volume in the second storage device 120, andthe destination data volume in the third storage device 130. Data storedin the first source data volume is the same as data stored in the secondsource data volume. The method may include the following steps.

In step 320, the first storage device 110 determines the replicationinformation according to the data stored in the first source datavolume. Specifically, when the replication task needs to be started, thefirst storage device 110 may determine the replication informationaccording to a differential bitmap of the first source data volume,where the replication information is used to indicate the data, whichneeds to be replicated to the second storage system 44, in the currentreplication task. For a specific manner in which the first storagedevice 110 determines the replication information, reference may be madeto the foregoing description, and details are not described hereinagain.

In step 322, the first storage device 110 sends a replication task startmessage to the second storage device 120. The replication task startmessage may carry an identifier of the first source data volume, anidentifier of the destination data volume, and the replicationinformation. The replication task start message is used to notify thesecond storage device 120 that the current replication task is areplication task between the first source data volume in the firststorage device 110 and the destination data volume in the third storagedevice 130. Because the first storage device 110 already determines thereplication information in the current replication task according to thefirst source data volume in step 320, the first storage device 110 maysend the replication information to the second storage device 120 at thesame time when instructing the second storage device 120 to start thereplication task. Therefore, the second storage device 120 may notdetermine the replication information by itself. It is understandablethat, in a situation in which the second storage device 120 does notrecord differential data, the first storage device 110 may also send thereplication information in the current replication task to the secondstorage device 120. It is understandable that, in an actual application,in addition to carrying the replication information, the replicationtask start message may further carry the identifier of the first sourcedata volume and the identifier of the destination data volume, forindicating that the current replication task is a replication taskbetween the first source data volume and the destination data volume.For a specific description of the replication task start message,reference may be made to the foregoing description, and specifically,the replication information may further be carried in another field ofthe data part of the foregoing replication task start message. Forexample, the replication information (for example, address informationof the to-be-replicated data) may be carried in a field after the eighthbyte, and details are not described herein again.

In step 324, the second storage device 120 determines, according to theidentifier of the first source data volume, the identifier of thedestination data volume, and a preset replication relationship, thesecond source data volume corresponding to the replication information.The replication relationship includes a correspondence among the firstsource data volume, the destination data volume, and the second sourcedata volume. Specifically, after receiving the replication task startmessage of the first storage device 110, the second storage device 120may determine, according to the preset replication relationship and theidentifier of the first source data volume and the identifier of thedestination data volume that are carried in the replication task startmessage, the second source data volume in the second storage device 120that stores the same data as the first source data volume, so that thesecond source data volume corresponding to the replication informationcan be determined.

FIG. 3b and FIG. 3c show two implementation manners in which the firststorage system 33 determines the replication information according tothis embodiment of the present disclosure. In an actual application, thereplication information determined by the first storage system 33 in thecurrent replication task may further be determined in another manneraccording to specific network deployment, and a specific manner in whichthe first storage system 33 determines the replication information isnot limited herein. It should be noted that, in this embodiment of thepresent disclosure, because the replication task is a replication taskexecuted between the data volumes, all the replication information inthis embodiment of the present disclosure refers to replicationinformation of a data volume in a storage device. Generally, one datavolume corresponds to one piece of replication information in areplication task.

In step 302, the first storage system 33 determines first replicationsub-information and second replication sub-information according to thereplication information. The first replication sub-information is usedto indicate data that needs to be replicated by the first storage device110 to the third storage device 130, and the second replicationsub-information is used to indicate data that needs to be replicated bythe second storage device 120 to the third storage device 130. It shouldbe noted that, when the first replication sub-information and the secondreplication sub-information are being determined, data ranges indicatedby the first replication sub-information and the second replicationsub-information need to be determined, so as to prevent data replicatedby the first storage device 110 and the second storage device 120 frombeing repeated.

In an actual application, in a situation, the first replicationsub-information and the second replication sub-information may berespectively determined by the first storage device 110 and the secondstorage device 120 that are in the first storage system 33.Specifically, the first storage device 110 may determine the firstreplication sub-information according to the replication informationdetermined in step 300 and a preset replication policy, and the secondstorage device 120 may determine the second replication sub-informationaccording to the replication information determined in step 300 and thepreset replication policy. In an actual application, a replicationpolicy may be preset in the first storage device 110, where thereplication policy may include a replication ratio, a replication range,and the like. For example, in a situation, the set policy may be: thefirst storage device 110 performs replication in a direction from aheader of a replication bitmap to a tail, the second storage device 120performs replication in a direction from the tail of the replicationbitmap to the header, and replication ends when replication ranges areoverlapped. In another situation, the set policy may be: the firststorage device 110 replicates 60% differential data in a direction froma header of a replication bitmap to a tail, and the second storagedevice replicates 40% differential data in a direction from the tail ofthe replication bitmap to the header. In still another situation, theset policy may further be: the first storage device 110 and the secondstorage device 120 replicate differential data separately from themiddle of a replication bitmap to two ends. In addition, the set policymay further include an address range of the data that needs to bereplicated by the first storage device 110, an address range of the datathat needs to be replicated by the second storage device 120, and thelike. In an actual application, the replication policy may be setaccording to a specific case, and the specific replication policy is notlimited herein.

It should be noted that, because in this embodiment of the presentdisclosure, when asynchronous replication is started, a currentreplication task needs to be shared by the first storage device 110 andthe second storage device 120, a same replication policy needs to be setin the first storage device 110 and the second storage device 120, so asto prevent the first storage device 110 and the second storage device120 from replicating same data to the third storage device 130.

In still another situation, to enable the first storage device 110 andthe second storage device 120 to implement load balance in thereplication process, the first storage device 110 may negotiate with thesecond storage device 120 according to the replication information, todetermine the first replication sub-information and the secondreplication sub-information. Specifically, as shown in FIG. 1, in anegotiation process, the first storage device 110 may receive anegotiation request of the second storage device 120, where thenegotiation request includes at least bandwidth information of a secondlink. The first storage device 110 may determine the first replicationsub-information and the second replication sub-information according tothe bandwidth information of the second link. For example, if bandwidthof the second link is greater than bandwidth of a first link, the firststorage device 110 may determine that an amount of data indicated by thefirst replication sub-information is less than an amount of dataindicated by the second replication sub-information. For example, thedata indicated by the first replication sub-information is 30% of atotal amount of data that needs to be replicated, and the data indicatedby the second replication sub-information is 70% of the total amount ofdata that needs to be replicated. After the first storage device 110determines the first replication sub-information and the secondreplication sub-information, the first storage device 110 may send thedetermined second replication sub-information to the second storagedevice 120. The second replication sub-information may be sent to thesecond storage device 120 in a form of a negotiation response message,and an address range of the data indicated by the second replicationsub-information needs to be carried in the negotiation response message.

It should be noted that, in this embodiment of the present disclosure,both the first replication sub-information and the second replicationsub-information are a part of the replication information.Alternatively, in other words, both the data indicated by the firstreplication sub-information and the data indicated by the secondreplication sub-information are a part of data indicated by thereplication information. It is understandable that, if the first storagesystem 33 includes only two storage devices, the replication informationmay consist of the first replication sub-information and the secondreplication sub-information. In this embodiment of the presentdisclosure, when the replication information is represented in a mannerof a replication bitmap, the first replication sub-information or thesecond replication sub-information does not need to be represented byusing an additional separate replication bitmap, and a range of thefirst replication sub-information or the second replicationsub-information may be identified on the replication bitmap. In thisembodiment of the present disclosure, a specific form of the firstreplication sub-information and the second replication sub-informationis not limited, as long as a range of data that needs to be replicatedcan be identified by the first replication sub-information and thesecond replication sub-information.

It is understandable that this embodiment of the present disclosure isdescribed only by assuming that the first storage system 33 includes twostorage devices. In an actual application, if the first storage system33 includes N storage devices, when asynchronous replication is started,the replication information in the current replication task may bedivided into N parts. In other words, N pieces of replicationsub-information may be determined according to the replicationinformation in the current replication task, and the N devices performreplication simultaneously to a third device respectively according tothe N pieces of replication sub-information, where N is a natural numbergreater than 2. When the replication information is represented in amanner of a replication bitmap, the first replication sub-informationand the second replication sub-information may also be represented in amanner of a replication bitmap.

In step 304, the first storage device 110 replicates, to the thirdstorage device 130 according to the first replication sub-information,the data that needs to be replicated by the first storage device 110 tothe third storage device 130. For example, when the replicationinformation is represented in a manner of a replication bitmap, thefirst storage device 110 may replicate, to the destination data volumein the third storage device 130 according to the first replicationsub-information, data in the first source data volume corresponding to aposition with a flag bit of “1” in the replication bitmap. Specifically,the first storage device 110 may send, to the destination data volume inthe third storage device 130 by using a write data command or areplication command, data that needs to be replicated by the firstsource data volume in the first storage device 110 to the third storagedevice 130.

As shown in FIG. 1, because the first storage device 110 and the thirdstorage device 130 implement data backup by using an asynchronousreplication technology, compared with the data stored in the firststorage device 110, there is a particular time delay for the data storedin the third storage device 130. Generally, the first storage device 110receives multiple write data commands of the host 100 in a period oftime, and when the first storage device 110 performs remote asynchronousreplication to the third storage device 130, the host 100 may still senda write data command to the first storage device 110. Therefore, whenthe current replication task is being executed, it is necessary todistinguish, from new data received by the first storage device 110,data that is sent by the first storage device 110 to the third storagedevice 130 in the current replication task.

In this embodiment of the present disclosure, the data that is sent bythe first storage device 110 to the third storage device 130 in thereplication process and the new data received by the first storagedevice 110 in the replication process may be distinguished by using asnapshot technology. A snapshot is a fully usable copy of a specifiedcollection of data, where the copy includes an image of correspondingdata at a time point (a time point at which the copy begins). Thesnapshot may be a duplicate of the data represented by the snapshot, ora replica of the data. In this embodiment of the present disclosure,when replication is started, a state view may be created for a datavolume at a creation moment, only data at the creation moment of thedata volume can be seen by using the view, and modification (new data iswritten) to the data volume after the time point is not reflected in thesnapshot view. Data replication may be performed by using the snapshotview. For the first storage device 110, because snapshot data is“static”, the first storage device 110 may replicate snapshot data tothe third storage device 130 after creating a snapshot for data at eachtime point. In this manner, remote data replication may be complete, andthat the first storage device 110 continues to receive the write datacommand sent by the host 100 during replication is not affected.Therefore, the first storage device 110 may perform snapshot processingon the data in the first source data volume at a moment of startingreplication to form a data duplicate of the first source data volume atthe moment, and send the data duplicate to the destination data volumein the third storage device 130. It should be noted that the dataduplicate is the data to be replicated to the third storage device 130in the current replication task.

Optionally, in this embodiment of the present disclosure, the foregoingproblem may also be resolved in a manner of adding a time slice numberto each write data command received by the first storage device 110. Forexample, the first storage device 110 may include a current time slicenumber manager, where the current time slice number manager saves acurrent time slice number. The current time slice number may berepresented by a value, such as 0, 1, or 2; the current time slicenumber may also be represented by a letter, such as a, b, or c, which isnot limited herein. When the first storage device 110 receives a writedata command, a first number assigned by the current time slice numberis added to data and an address of the data that are carried in thewrite data command. When a remote asynchronous replication task istriggered, the data corresponding to the first number is used as data tobe replicated to the third storage device 130 in the current replicationtask, and the data corresponding to the first number and the address ofthe data are sent to the third storage device 130. The current timeslice number is also modified, so as to identify a subsequent write datacommand. In addition, in this embodiment of the present disclosure, thedata that needs to be sent by the first storage device 110 to the thirdstorage device 130 in the current replication task and the new datareceived by the first storage device 110 may further be distinguished inanother manner, which is not limited herein.

In this embodiment of the present disclosure, an example in which a datavolume included in a storage space of the first storage device 110 is aLUN is used. When a replication task is started, for example, when thereplication information is being determined, the first storage device110 may create a duplicate of a first source LUN at a current moment bymeans of snapshot. Then, in step 304, the first storage device 110 mayreplicate data, in the created duplicate of the first source LUN, of anaddress corresponding to a grid with a flag bit of “1” in the firstreplication sub-information to a destination LUN of the third storagedevice 130. A person skilled in the art may understand that, in this LUNduplicate creation manner, even if new data is written again to thefirst storage device 110 in the same address of the first source LUN inthe current replication task, the data needing to be replicated in thecurrent replication task is not affected.

In this step, an example in which the first storage device 110replicates the data to the third storage device 130 by using a writedata command is used. When the first storage device 110 replicates thedata to the destination LUN of the third storage device 130 according tothe created duplicate of the first source LUN and the first replicationsub-information, the write data command sent by the first storage device110 to the third storage device 130 needs to include an identifier ofthe destination LUN, the to-be-written data, and an address of theto-be-written data that are in the current replication task. Theidentifier of the destination LUN may be a LUN ID. The address of theto-be-written data may be an LBA of the data, for representing adestination address of the data. The third storage device 130 may writethe data carried in the write data command to the destination LUN of thethird storage device 130 according to the identifier of the destinationLUN and the LBA of the to-be-written data. It is understandable that,the write data command sent by the first storage device 110 to the thirdstorage device 130 may further carry an ID of the first source LUN ofthe to-be-written data, for identifying that the data is data sent fromthe first source LUN of the first storage device 110.

In step 306, the second storage device 120 replicates, to the thirdstorage device 130 according to the second replication sub-information,the data that needs to be replicated by the second storage device 120 tothe third storage device 130. Because step 306 is similar to step 304,reference may be made to a relevant description of step 304. It shouldbe noted that, in this embodiment of the present disclosure, there is nosequence for step 304 and step 306, and the replication process for thefirst storage device 110 and the replication process for the secondstorage device 120 may be performed at the same time.

In this embodiment of the present disclosure, the first storage device110 and the second storage device 120 keep consistency of stored data ina synchronous replication manner, and the replication information in thecurrent replication task is determined according to differential datainformation written to a data volume. For example, the replicationinformation in the current replication task is determined according todifferential data information written to the first source data volume inthe first storage device 110. The data stored in the first source datavolume in the first storage device 110 is the same as the data stored inthe second source data volume in the second storage device 120, and thefirst replication sub-information and the second replicationsub-information are determined according to the same replicationinformation, the first source data volume also stores data the same asthe data indicated by the second replication sub-information, and thesecond source data volume also stores data the same as the dataindicated by the first replication sub-information.

In the method shown in FIG. 3, because a first storage device 110 and asecond storage device 120 that are in a first storage system 33 storesame data, the first storage device 110 and the second storage device120 may have same replication information. In an asynchronousreplication process, the first storage system 33 determines firstreplication sub-information and second replication sub-information byusing the replication information, the first storage device 110replicates data to a third storage device 130 according to the firstreplication sub-information, and the second storage device 120replicates data to the third storage device 130 according to the secondreplication sub-information. According to the method provided by thisembodiment of the present disclosure, a same replication task of thefirst storage system 33 can be shared by the first storage device 110and the second storage device 120 that are in the first storage system33. Compared with the prior art in which only one link is used forreplication, in the method shown in FIG. 3, bandwidth of a replicationlink between the first storage system 33 and a second storage system 44can be improved without increasing production costs, thereby improvingreplication efficiency. For example, if bandwidth of both a first linkand a second link shown in FIG. 1 is 10 M, according to the method inthis embodiment of the present disclosure, when data in a LUN in thefirst storage system 33 is replicated to the third storage device 130,bandwidth of a replication link may reach 20 M. However, according tothe method in the prior art in which only one link is used forreplication, bandwidth of a replication link is merely 10 M.

In addition, compared with the prior art in which only one storagedevice in the first storage system 33 is used for replication, accordingto the method in the embodiment shown in FIG. 3, when remoteasynchronous replication is started, multiple storage devices in thefirst storage system 33 are used at the same time, so that resourceconsumption for a single storage device in the first storage system 33can be reduced.

FIG. 4a and FIG. 4b are a signaling diagram of a data replication methodaccording to an embodiment of the present disclosure. In the methodshown in FIG. 4a and FIG. 4b , the method shown in FIG. 3 is furtherdescribed in detail from perspectives of a first storage system 33 and asecond storage system 44. It is understandable that specific structuresof a first storage device 110, a second storage device 120, and a thirdstorage device 130 in the method shown in FIG. 4a and FIG. 4b may stillbe shown in FIG. 2. Specifically, the method shown in FIG. 4a and FIG.4b may be performed separately by a processor in the first storagedevice 110, a processor in the second storage device 120, and aprocessor in the third storage device 130. For ease of description, inthis embodiment of the present disclosure, replication information isrepresented by a replication bitmap, and differential data informationis represented by a differential bitmap. In this embodiment of thepresent disclosure, a description is provided still by using an examplein which a current replication task is a replication task among a firstsource data volume in the first storage device 110, a second source datavolume in the second storage device 120, and a destination data volumein the third storage device 130. The following describes the methodshown in FIG. 4a and FIG. 4b with reference to FIG. 1 and FIG. 3.Specifically, as shown in FIG. 4a and FIG. 4b , the method includes:

In step 402, the first storage device 110 determines to start anasynchronous replication task. In an actual application, the firststorage device 110 may determine, according to a set timer, whether tostart an asynchronous replication task; when the timer reaches a setasynchronous replication time, the first storage device 110 determinesto start the asynchronous replication task. It is understandable that,that the first storage device 110 starts asynchronous replication bysetting a timer is merely one implementation manner. In an actualapplication, the first storage device 110 may further determine,according to an amount of received data, whether to start theasynchronous replication task. When an amount of data received by thefirst storage device 110 is relatively small, the first storage device110 may determine to start the asynchronous replication task.

In step 404, the first storage device 110 stops receiving a write datacommand of a host 100, and processes a received write data command. Inan actual application, that the host 100 writes data to the firststorage device 110 or the host 100 reads data from the first storagedevice 110 is implemented by sending an input/output (Input/output, I/O)command to the first storage device 110. In this embodiment of thepresent disclosure, when the first storage device 110 determines tostart an asynchronous replication process to the third storage device,the first storage device may stop receiving the write data command ofthe host, so that the first storage device 110 may determine, accordingto data received before reception of the write data command of the host100 is stopped, data that needs to be replicated by the first sourcedata volume in the current replication task.

In this step, in a case in which the first storage device 110 and thesecond storage device 120 keep data consistency by means of synchronousreplication, processing the received write data command includes writingdata to the first storage device 110 and the second storage device 120according to the received write data command. Specifically, afterreceiving the write data command sent by the host 100, the first storagedevice 110 may first temporarily store data carried in the write datacommand, and then write the data in a cache to a data volume accordingto a set write policy. At the same time, after learning such a change, adata synchronization engine in the first storage device 110 mayimmediately send a changed data block from the cache to a cache of thesecond storage device 120 directly by using a SAN switch. Afterreceiving the data block, the second storage device 120 may send a writesuccess response to the first storage device 110. After receiving theresponse of the second storage device 120, the first storage device 110may return a write success response to the host 100, to notify the host100 that processing of the data in the write data command is completed.In this manner, the data that is written by the host 100 to the firstsource data volume in the first storage device 110 may be synchronouslywritten to the second source data volume in the second storage device120. Similarly, data that is written by the host 100 to the secondsource data volume in the second storage device 120 may also besynchronously written to the first source data volume in the firststorage device 110 in a similar manner. The first storage device 110 andthe second storage device 120 keep consistency of stored data in theforegoing synchronous replication manner.

In step 406, the first storage device 110 instructs the second storagedevice 120 to start asynchronous replication. In this embodiment of thepresent disclosure, the first storage device 110 and the second storagedevice 120 keep data stored in the first storage device 110 and datastored in the second storage device 120 the same in the synchronousreplication manner. To fully use bandwidth of a link between the firststorage device 110 and the third storage device 130 and of a linkbetween the second storage device 120 and the third storage device 130,and improve replication efficiency, in a replication process of thisembodiment of the present disclosure, the first storage device 110 andthe second storage device 120 may jointly complete a same replicationtask. Therefore, when the first storage device 110 determines to startasynchronous replication, it is required to instruct the second storagedevice 120 to start the asynchronous replication process.

In this step, the first storage device 110 may send a replication taskstart message to the second storage device 120. Specifically, in a firstsituation, the first storage device 110 may notify the second storagedevice 120 of a replication relationship involved in the currentreplication task, so that the second storage device 120 can determine,according to a preset replication relationship, an identifier of thedata volume that is in the second storage device 120 and that isinvolved in the current replication task. For example, the first storagedevice 110 may notify the second storage device 120 of a replicationrelationship identifier by using the replication task start message. Ina second situation, the first storage device 110 may add an identifierof the first source data volume and an identifier of the destinationdata volume to a replication task start message, so that the secondstorage device 120 may determine, according to a preset replicationrelationship, the identifier of the first source data volume, and theidentifier of the destination data volume, the second source data volumethat is in the second storage device 120 and that is involved in thecurrent replication task. In a third situation, after determining thereplication task, the first storage device 110 may determine anidentifier of the second source data volume according to a presetreplication relationship in the first storage device 110, an identifierof the first source data volume, and an identifier of the destinationdata volume, and add the identifier of the second source data volume toa replication task start message that is sent to the second storagedevice 120, so that the second storage device 120 can directly determinethe second source data volume in the second storage device 120 accordingto the received replication task start message. In a fourth situation,the first storage device 110 may add an identifier of the first sourcedata volume to a replication task start message, so that the secondstorage device 120 may determine, according to the identifier of thefirst source data volume and a preset replication relationship, thesecond source data volume that is in the second storage device 120 andthat is involved in the current replication task. In a fifth situation,the first storage device 110 may further add an identifier of thedestination data volume to a replication task start message, so that thesecond storage device 120 may determine, according to the identifier ofthe destination data volume and a preset replication relationship, thesecond source data volume that is in the second storage device 120 andthat is involved in the current replication task. A manner of how thesecond storage device 120 specifically determines the second source datavolume by using the replication task start message is not limited inthis embodiment of the present disclosure, as long as the second storagedevice 120 can determine the second source data volume that stores samedata as the first source data volume. For a specific description on thereplication task start message, reference may be made to a relevantdescription in the embodiment shown in FIG. 3, and details are notdescribed herein again.

In step 408, the second storage device 120 stops receiving a write datacommand of the host 100, and processes a received write data command.Specifically, after receiving a notification, from the first storagedevice 110, about starting asynchronous replication, the second storagedevice 120 may stop receiving the write data command of the host 100.Because step 408 is similar to step 404, for a specific description,reference may be made to step 404.

In step 410, the second storage device 120 creates a duplicate of thesecond source data volume at a current moment. In an actual application,the second storage device 120 may create the duplicate of the secondsource data volume at the current moment by means of snapshot. In thismanner, data that needs to be replicated by the second storage device120 in the current replication task may be determined, and the data thatneeds to be replicated in the current replication task and data that isnewly received by the second storage device 120 in the currentreplication task may be distinguished by using the duplicate of thesecond source data volume. For how to specifically create the duplicateof the second source data volume by using a snapshot technology,reference may be made to the foregoing description, and details are notdescribed herein again.

In step 412, the second storage device 120 generates a secondreplication bitmap according to a differential bitmap of the secondsource data volume. In this embodiment of the present disclosure, afterreceiving the write data command, which is sent by the host 100, forwriting data to the second source data volume, the second storage device120 updates the differential bitmap of the second source data volumeaccording to an address of data carried in the write data command. Forexample, the second storage device 120 may set a flag bit in a grid,corresponding to the address of the data, in the differential bitmap to“1”, for identifying that a data write occurs for the addresscorresponding to the grid. The differential bitmap of the second storagedevice 120 is used to record information about data written to thesecond source data volume in the second storage device 120 after aprevious replication task of the current replication task begins andbefore the current replication task begins. Because this step is similarto step 301, for how to specifically generate the second replicationbitmap according to the differential bitmap of the second source datavolume in the second storage device 120, reference may be made to adescription on step 301.

In step 414, the second storage device 120 notifies the first storagedevice 110 that the second storage device 120 is ready for asynchronousreplication. In an actual application, in order to make both the firststorage device 110 and the second storage device 120 ready forasynchronous replication when replication begins, for example, both thefirst storage device 110 and the second storage device 120 have prepareda replication bitmap and a duplicate of the data volume, after thesecond storage device 120 generates the second replication bitmap andcreates the duplicate of the second source data volume required in thecurrent replication task, the second storage device 120 may notify thefirst storage device 110 that the second storage device 120 is ready forasynchronous replication.

In step 416, the first storage device 110 creates a duplicate of thefirst source data volume at the current moment. After the first storagedevice 110 determines that the second storage device 120 is ready forasynchronous replication, the first storage device 110 begins to createthe duplicate of the first source data volume at the current moment.This step is similar to step 410, reference may be made to a descriptionon step 410 for details, and details are not described herein again.

In step 418, the first storage device 110 generates a first replicationbitmap according to a differential bitmap of the first source datavolume. Because step 418 is similar to step 412, reference may be madeto a description on step 412 for details. It should be noted that,because in this embodiment of the present disclosure, the first storagedevice 110 and the second storage device 120 keep consistency of storeddata by using a synchronous replication technology, and data stored inthe first source data volume is the same as data stored in the secondsource data volume, the first replication bitmap of the first storagedevice 110 is the same as the second replication bitmap of the secondstorage device 120.

In step 420, the first storage device 110 instructs the second storagedevice 120 to begin to receive a write data command of the host 100. Tosynchronize the first storage device 110 and the second storage device120, after determining that the first storage device 110 and the secondstorage device 120 are already ready for replication in the currentreplication task, the first storage device 110 may instruct the secondstorage device 120 to begin to receive the write data command of thehost 100. This is because the write data command of the host 100received after the devices are ready for replication does not affectexecution of the current replication task.

In step 422, the first storage device 110 begins to receive a write datacommand of the host 100. In step 424, the second storage device 120begins to receive a write data command of the host 100. It isunderstandable that, data written to the first storage device 110 andthe second storage device 120 after the current replication task isstarted will be replicated to the third storage device 130 in a nextreplication task.

It should be noted that, in an actual application, step 404, step 408,step 410, step 416, and step 420 to step 424 are optional. For example,as described above, if data that needs to be replicated by a storagedevice in the current replication task is determined in a manner ofadding a time slice number, when asynchronous replication is started,the first storage device 110 and the second storage device 120 may notstop receiving the write data command of the host 100, or may notnecessarily create a duplicate of a LUN.

In step 426, the first storage device 110 determines first replicationsub-information according to the first replication bitmap and areplication policy. In step 428, the second storage device 120determines the second replication sub-information according to thesecond replication bitmap and the replication policy. In this embodimentof the present disclosure, because the first replication bitmap is thesame as the second replication bitmap, and the preset replication policyis also the same, the first storage device 110 and the second storagedevice 120 may separately determine a replication sub-bitmap. Thereplication sub-bitmap herein may specifically include a firstreplication sub-bitmap determined by the first storage device 110 and asecond replication sub-bitmap determined by the second storage device120. The first replication sub-bitmap is used to identify informationabout data that needs to be replicated by the first storage device 110to the third storage device 130, and the second replication sub-bitmapis used to identify information about data that needs to be replicatedby the second storage device 120 to the third storage device 130. It isunderstandable that, a specific replicate range for the first storagedevice 110 and the second storage device 120 may be specified in thereplication policy. Therefore, the data indicated by the firstreplication sub-bitmap and the data indicated by the second replicationsub-bitmap may not be repeated. Step 426 and step 428 are similar tostep 302, and reference may be made to a relevant description on step302 for details.

In step 430, the first storage device 110 replicates a part ofdifferential data to the third storage device 130 according to the firstreplication sub-bitmap. Specifically, during replication, the firststorage device 110 may replicate a part of the differential dataaccording to a replication rule of the first storage device 110 in onereplication process of the current replication task. For example, aquantity of LBAs of data blocks duplicated once may be set in thereplication rule.

In step 432, the third storage device 130 returns a response for singlereplication success to the first storage device 110, to notify the firststorage device 110 that the data replicated this time has beensuccessfully written to the third storage device 130.

In step 434, the first storage device 110 updates the first replicationsub-bitmap according to the response for single replication successreturned by the third storage device 130. Specifically, when the firstreplication sub-bitmap is being updated, a replication completionidentifier may be marked in a grid, in the first replication sub-bitmap,corresponding to an address of data that has been replicated, or a flagbit in a grid, in the first replication sub-bitmap, corresponding to theaddress of the data that has been replicated may be deleted. Forexample, a flag bit “1” in a grid, in the first replication sub-bitmap,corresponding to the address of the data that has been replicated may bedeleted. It is understandable that, because the first replicationsub-bitmap is a part of the first replication bitmap, updating the firstreplication sub-bitmap is updating the first replication bitmap.

In step 436, the first storage device 110 sends replication progressinformation of the first storage device 110 to the second storage device120. In this embodiment of the present disclosure, each time the firststorage device 110 finishes replicating differential data, the firststorage device 110 needs to send the replication progress information ofthe first storage device 110 to the second storage device 120, so thatthe second storage device 120 may know replication progress of the firststorage device 110. If a fault occurs on the first storage device 110 inthe replication process, the second storage device 120 may take over thereplication task of the first storage device 110 according to thereplication progress information of the first storage device 110. Todistinguish the replication progress information of the first storagedevice 110 from replication progress information of the second storagedevice 120, in this embodiment of the present disclosure, thereplication progress information of the first storage device 110 isreferred to as first replication progress information, and thereplication progress information of the second storage device 120 isreferred to as second replication progress information.

In an actual application, the first storage device 110 may send thefirst replication progress information to the second storage device 120in a form of a replication progress message. A format of the replicationprogress message may be similar to a format of the foregoing replicationtask start message. A message type (for example, opCode) field, a sourcedevice ID (for example, srcAppld) field, and a destination device ID(for example, dstAppld) field may also be included in a header of thereplication progress message. The message type field is used torepresent that a type of the message is replication progressinformation. In addition to carrying a source data volume and adestination data volume in the current replication task, a content part(for example, a data field of the replication progress message) of thereplication progress message further needs to carry current replicationprogress information. For example, a format of the content part of thereplication progress message may be shown as follows:

Byte 0 1 2 3 0 LUN ID 4 ReplicationObject LUN id 8 Address

where for a description on the “LUN ID” field and the “ReplicationObjectLUN id” field, reference may be made to the foregoing description on thereplication task start message, and details are not described hereinagain.

Address: the field may be located after the eighth byte, and is used tocarry the current replication progress information. The replicationprogress information may be one or more logical block addresses (LogicalBlock Address, LBA) or an updated replication bitmap, for example, maybe an updated first replication sub-bitmap. When the replicationprogress information is one LBA, the replication progress informationmay be represented by an LBA of a last piece of data that has beenreplicated. When the replication progress information is multiple LBAs,the replication progress information may be represented by addresses ofall data that has been replicated currently. A specific form of thereplication progress information is not limited herein, as long as thereplication progress information can represent replication progress.

For example, in the first replication progress information sent by thefirst storage device 110 to the second storage device 120, in a headerof the first replication information message, a “source device ID” fieldmay be an IP address of the first storage device 110, and a “destinationdevice ID” field may be an IP address of the second storage device 120.In a content part of the first replication progress information, a “LUNID” field may be an ID of the first source data volume in the firststorage device 110 that stores to-be-replicated data; a“ReplicationObject LUN id” field may be an ID of the destination datavolume in the third storage device 130; and an “Address” field may be anLBA of the last piece of data that has been replicated. It should benoted that, in this embodiment of the present disclosure, because areplication relationship is established in advance among the firstsource data volume in the first storage device 110, the second sourcedata volume in the second storage device 120, and the destination datavolume in the third storage device 130, the second storage device 120may determine, according to the LUN id field in the first replicationprogress information and the preset replication relationship, the secondsource data volume in the second storage device 120 corresponding to thecurrent replication task. In addition, an ID of a LUN corresponding tothe first storage device 110 and an ID of a LUN corresponding to thesecond storage device 120 may be the same or may be different, as longas data volumes that store same data can be determined according to thereplication relationship.

In step 438, the second storage device 120 replicates a part of thedifferential data to the third storage device 130 according to thesecond replication sub-bitmap. In step 440, the third storage device 130returns a response for a single replication success to the secondstorage device 120, to notify the second storage device 120 that thedata replicated this time has been successfully written to the thirdstorage device 130. In step 442, the second storage device 120 updatesthe second replication sub-bitmap according to the response for singlereplication success returned by the third storage device 130. In step444, the second storage device 120 sends the replication progressinformation of the second storage device 120 to the first storage device110.

It should be noted that, one replication process of the second storagedevice 120 in the current replication task is described in step 438 tostep 444, and the process is similar to one replication process,described in step 430 to step 436, of the first storage device 110 inthe current replication task. Therefore, for descriptions on step 438 tostep 444, reference may be made respectively to relevant descriptions onstep 430 to step 436.

In step 446, when the first storage device 110 determines thatreplication is completed according to the first replication sub-bitmap,the first storage device 110 finishes the current replication task. Inthe replication process, the first storage device 110 circularlyexecutes actions in step 430 to step 436, and the first storage device110 may finish this replication task until the first storage device 110determines that the data that needs to be replicated by the firststorage device 110 has been replicated according to the firstreplication sub-bitmap. It is understandable that, if the firstreplication sub-bitmap is determined according to the preset replicationpolicy, and it is set in the preset replication policy that the firststorage device 110 performs replication in a direction from a startaddress of data indicated by the replication bitmap to an end address,and that the second storage device 110 performs replication in adirection from the end address of the data indicated by the replicationbitmap to the start address, the first storage device 110 determines tofinish the current replication task when determining, according to thereplication progress information of the second storage device 120, thatthere is repetition between the data needing to be replicated by thefirst storage device 110 and the data that has been replicated by thesecond storage device 120. It should be noted that, after the currentreplication task is finished, the duplicate, of the first source datavolume in the first storage device 110, created by the first storagedevice 110 in step 416 needs to be deleted.

In step 448, when the second storage device 120 determines thatreplication is completed according to the second replication sub-bitmap,the second storage device 120 finishes the current replication task. Inthe replication process, the second storage device 120 circularlyexecutes actions in step 438 to step 444, and the second storage device120 may finish the current replication task until the second storagedevice 120 determines that the data that needs to be replicated by thesecond storage device 120 has been replicated according to the secondreplication sub-bitmap. It should be noted that, after the currentreplication task is finished, the duplicate, of the second source datavolume in the second storage device 120, created by the second storagedevice 120 in step 410 needs to be deleted.

It should be noted that, in the replication process, the first storagedevice 110 and the second storage device 120 may separately execute thereplication task. In this embodiment of the present disclosure, asequence in which the first storage device 110 executes step 430 to step436 and the second storage device 120 executes step 438 to step 444 isnot limited.

In still another situation, if a fault occurs on the first storagedevice 110 in the replication process, after the second storage device120 determines that a fault occurs on the first storage device 110, thesecond storage device 120 may determine, according to the secondreplication bitmap and replication progress information of the firststorage device 110 that is received last time before the fault occurs onthe first storage device 110, data that has not been replicated by thefirst storage device 110, and replicate, from the second source datavolume to the destination data volume in the third storage device 130,the data that has not been replicated by the first storage device 110.

In an actual application, the second storage device 120 may detect,according a heartbeat between the second storage device 120 and thefirst storage device 110, whether a fault occurs on the first storagedevice 110. For example, when the second storage device 120 does notreceive a heartbeat signal of the first storage device 110 within a settime, the second storage device 120 may determine that a fault occurs onthe first storage device 110. Further, to improve detection accuracy,after determining, according to the heartbeat, that a fault occurs onthe first storage device 110, the second storage device 120 may send aquery request to the third storage device 130, where the query requestis used to query a communication status between the first storage device110 and the third storage device 130, and if a query response returnedby the third storage device 130 reveals that communication between thefirst storage device 110 and the third storage device 130 is alreadyinterrupted, the second storage device 120 may determine that a faultoccurs on the first storage device 110. The second storage device 120may take over the replication task of the first storage device 110, andreplicate, to the third storage device 130, data that has not beenreplicated by the first storage device 110 to the third storage device130.

In this embodiment of the present disclosure, because the second storagedevice 120 may also notify the first storage device 110 of thereplication progress information of the second storage device 120, if afault occurs on the second storage device 120 in the replicationprocess, the first storage device 110 may also take over the replicationtask of the second storage device 120 according to the replicationprogress information of the second storage device 120 and the firstreplication bitmap, and replicate, to the third storage device 130, datathat has not been replicated by the second storage device 120 to thethird storage device 130. The process is similar to the foregoingprocess in which the second storage device 120 takes over thereplication task of the first storage device 110, reference may be madeto the foregoing description for details, and details are not describedherein again.

The method shown in FIG. 4a and FIG. 4b is based on the method shown inFIG. 3. In a replication process, a first storage device 110 and asecond storage device 120 notify each other of replication progress.Therefore, in the replication process, if a fault occurs on the firststorage device 110 or the second storage device 120, one storage deviceon which no fault occurs in a first storage system 33 can continue tocomplete a replication task of a storage device on which a fault occurs.Therefore, even though a fault occurs on one of the storage devices, thereplication task of the first storage system 33 may not be interrupted,thereby further enhancing system stability at the time of improvingreplication efficiency.

FIG. 5 is a flowchart of still another data replication method accordingto an embodiment of the present disclosure. In the method shown in FIG.5, a 3DC disaster recovery system including three storage devices isstill used as an example. The method shown in FIG. 5 is described from aperspective of a second storage system 44 that receives data, where afirst storage system 33 includes a first storage device 110 and a secondstorage device 120, and the second storage system 44 may include thethird storage device 130 in the level-2 site 13 shown in FIG. 1. In thisembodiment of the present disclosure, a description is provided still byusing an example in which a current replication task is a replicationtask among a first source data volume in the first storage device 110, asecond source data volume in the second storage device 120, and adestination data volume in the third storage device 130. The followingdescribes the method shown in FIG. 5 with reference to FIG. 1. As shownin FIG. 5:

In step 500, the third storage device 130 receives replicationinformation sent by the first storage system 33, where the replicationinformation is used to indicate data that needs to be replicated by thefirst storage system 33 to the third storage device 130. In an actualapplication, when an asynchronous replication task between the firststorage system 33 and the second storage system 44 is started, the firststorage device 110 or the second storage device 120 in the first storagesystem 33 may send the replication information to the third storagedevice 130 in the second storage system 44. An example in which thefirst storage device 110 sends the replication information to the thirdstorage device 130 is used below. Specifically, the first storage device110 in the first storage system 33 may determine the replicationinformation according to data stored in the first source data volume.The replication information may be obtained according to differentialdata information of the first source data volume in the first storagedevice 110 when the current replication task begins. After determiningthe replication information in the current replication task, the firststorage device 110 may send the determined replication information tothe third storage device 130. The replication information may berepresented in a form of a replication bitmap, or may be represented inanother structure form such as a tree structure, which is not limitedherein. For a relevant description on the replication information andthe differential data information, reference may be made to adescription on step 300 in FIG. 3, and details are not described hereinagain.

In an actual application, the first storage device 110 may send thereplication information to the third storage device 130 by sending areplication task start message to the third storage device 130. Thereplication task start message carries an identifier of the first sourcedata volume and the replication information that is determined accordingto the data stored in the first source data volume. For a specificdescription on the replication task start message, reference may be madeto a relevant description in the embodiment shown in FIG. 3, or FIG. 4aand FIG. 4 b.

In step 502, the third storage device 130 sends a first acquisitionrequest to the first storage device 110 according to the replicationinformation. The first acquisition request includes information aboutdata that needs to be acquired by the third storage device 130 from thefirst storage device 110. The information about the data included in thefirst acquisition request includes at least an identifier of a datavolume to which the data that needs to be acquired belongs and addressinformation of the data. For example, the information about the dataincluded in the first acquisition request includes at least theidentifier of the first source data volume and an address of the datarequested by using the first acquisition request. The identifier of thedata volume to which the data that needs to be acquired belongs may bean identifier of a LUN, and the address of the data may be an LBA.

In an actual application, the first acquisition request may be in acommand format such as a read command or a replication command, which isnot limited herein. Alternatively, the third storage device 130 mayprepare a storage space according to the received replicationinformation, and then send the first acquisition request to the firststorage device 110, so that data that is sent by the first storagedevice 110 according to the first acquisition request can be receivedand stored in a timely manner.

It should be noted that, in this step, for the command format of thefirst acquisition request, reference may be made to the format of thereplication progress message in step 436, and a message type in thefirst acquisition request needs to specify that the message is anacquisition request. The information, which needs to be carried in thefirst acquisition request, about the requested data may include: anidentifier of the first source data volume in the first storage devicein which the data that needs to be acquired in the current replicationtask is located and an address of the data.

In step 504, the third storage device 130 sends a second acquisitionrequest to the second storage device 120 in the first storage system 33according to the replication information. The second acquisition requestincludes information about data that needs to be acquired by the thirdstorage device 130 from the second storage device 120. Specifically,after receiving the replication task start message sent by the firststorage device 110, the third storage device 130 may determine thesecond source data volume in the second storage device 120 and thedestination data volume in the third storage device 130 according to thepreset identifier of the first source data volume and a presetreplication relationship. Data stored in the second source data volumeis the same as the data stored in the first source data volume, and thedestination data volume is configured to store data received by thethird storage device 130 in the current replication task. Afterdetermining the second source data volume, the third storage device 130may send the second acquisition request to the second storage device120, and the information, which is included in the second acquisitionrequest, about the requested data includes at least an identifier of thesecond source data volume in the second storage device and an address ofthe data requested by using the second acquisition request. For aspecific description on the replication relationship, reference may bemade to a description in the embodiment shown in FIG. 3.

It is understandable that, to prevent data replicated by the firststorage device 110 and the second storage device 120 from beingrepeated, the address of the data in the second acquisition request isdifferent from the address of the data in the first acquisition request.In this manner, the data requested by using the second acquisitionrequest is different from the data requested by using the firstacquisition request. Because step 504 is similar to step 502, for arelevant description on the second acquisition request, reference may bemade to a description on step 502.

It should be noted that, in this embodiment of the present disclosure,there is no sequence for step 502 and step 504, and the third storagedevice 130 may simultaneously send a request separately to the firststorage device 110 and the second storage device 120, so as to acquiredata separately from the first storage device 110 and the second storagedevice 120.

Preferably, to enable the first storage device 110 and the secondstorage device 120 to implement load balance in a replication process,in step 502, when the third storage device 130 sends the firstacquisition request to the first storage device 110, the third storagedevice 130 may determine, according to bandwidth of a link between thethird storage device 130 and the first storage device 110, an amount ofdata to be acquired from the first storage device 110. Specifically,after determining, according to the received replication information andthe bandwidth of the link between the third storage device 130 and thefirst storage device 110, the data that needs to be acquired from thefirst storage device 110, the third storage device 130 may send thefirst acquisition request to the first storage device 110 according tothe determined data that needs to be acquired from the first storagedevice 110. Similarly, in step 504, when the third storage device 130sends the second acquisition request to the second storage device 120,the third storage device 130 may also determine, according to bandwidthof a link between the third storage device 130 and the second storagedevice 120, an amount of requested data from the second storage device120. Specifically, after determining, according to the receivedreplication information and the bandwidth of the link between the thirdstorage device 130 and the second storage device 120, the data thatneeds to be acquired from the second storage device 120, the thirdstorage device 130 may send the second acquisition request to the secondstorage device 120 according to the determined data that needs to beacquired from the second storage device 120.

In step 506, the third storage device 130 receives data that is sent bythe first storage device 110 according to the first acquisition request.Specifically, the first storage device 110 may send corresponding datastored in the first source data volume to the third storage device 130according to the address, of the data that needs to be acquired, carriedin the first acquisition request, to replicate the data to thedestination data volume in the third storage device 130.

In step 508, the third storage device 130 receives data that is sent bythe second storage device 120 according to the second acquisitionrequest. The second storage device 120 may also send corresponding datastored in the second source data volume to the third storage device 130according to the address, of the data that needs to be acquired, carriedin the second acquisition request, to replicate the data to thedestination data volume in the third storage device 130.

In the method shown in FIG. 5, because a third storage device 130 mayreceive replication information, in a current replication task, sent bya first storage device 110, the third storage device 130 maysimultaneously acquire data from the first storage device 110 and asecond storage device 120, thereby improving replication link bandwidthand replication efficiency. Further, the third storage device 130 mayautonomously select ranges of data acquired from the first storagedevice 110 and the second storage device 120 after preparing a storagespace, so that autonomy of the third storage device 130 is stronger, andan operation is more flexible. In addition, when acquisition data fromthe first storage device 110 and the second storage device 120, thethird storage device 130 may determine, according to link bandwidth, anamount of requested data from the first storage device 110 or the secondstorage device 120, so that the first storage device 110 and the secondstorage device 120 can implement load balance in the current replicationtask.

FIG. 6a and FIG. 6b are a signaling diagram of still another datareplication method according to an embodiment of the present disclosure.In the method, how data in a first storage system 33 is replicated to asecond storage system 44 is described still by using the 3DC disasterrecovery system including three storage devices shown in FIG. 1 as anexample. For ease of description, in this embodiment of the presentdisclosure, replication information is represented by a replicationbitmap, and differential data information is represented by adifferential bitmap for description. The following describes FIG. 6a andFIG. 6b with reference to FIG. 1. Specifically, as shown in FIG. 6a andFIG. 6b , the method may include the following steps.

In step 602, a first storage device 110 determines to start anasynchronous replication task. In step 604, the first storage device 110stops receiving a write data command of a host 100, and processes areceived write data command. In step 606, the first storage device 110instructs a second storage device 120 to start asynchronous replication.In step 608, the second storage device 120 stops receiving a write datacommand of the host, and processes a received write data command. Instep 610, the second storage device 120 creates a duplicate of a LUN ata current moment. In step 612, the second storage device 120 notifiesthe first storage device 110 that the second storage device 120 is readyfor asynchronous replication. In step 614, the first storage device 110creates a duplicate of a LUN at the current moment. In step 616, thefirst storage device 110 generates a replication bitmap according to adifferential bitmap.

The foregoing steps are similar to step 402 to step 410 and step 414 tostep 418 in the embodiment shown in FIG. 4a and FIG. 4b , and referencemay be made to descriptions separately on the relevant steps in FIG. 4aand FIG. 4b for details.

In step 618, the first storage device 110 sends the replication bitmapto a third storage device 130. Specifically, when sending thereplication bitmap, the first storage device 110 may send thereplication bitmap to the third storage device 130 still in a manner ofadding the replication bitmap to a content part of a replication taskstart message. For a specific description on the replication task startmessage, reference may be made to a relevant description in theembodiment shown in FIG. 3, FIG. 4a and FIG. 4b , or FIG. 5.

In step 620, the third storage device 130 returns, to the first storagedevice 110, a response for receiving the replication bitmap. In step622, the first storage device 110 deletes a local replication bitmap.Specifically, after the first storage device 110 successfully sends thereplication bitmap to the third storage device 130, the first storagedevice 110 may delete the local replication bitmap to save resources. Itshould be noted that, in the embodiment shown in FIG. 6a and FIG. 6b ,the second storage device 120 may further generate a replication bitmapand then send the replication bitmap to the third storage device 130,which is not limited herein. In the embodiment shown in FIG. 6a and FIG.6b , it is unnecessary for both the first storage device 110 and thesecond storage device 120 to generate a replication bitmap, as long asthe first storage device 110 or the second storage device 120 generatesone replication bitmap.

In step 624, the first storage device 110 instructs the second storagedevice 120 to begin to receive a write data command of the host. In step626, the first storage device 110 begins to receive a write data commandof the host 100. In step 628, the second storage device 120 begins toreceive a write data command of the host 100. Step 624 to step 628 arerespectively similar to step 420 to step 424 in FIG. 4a and FIG. 4b ,and reference may be made to relevant descriptions on step 420 to step424 in FIG. 4a and FIG. 4b for details.

In step 630, the third storage device 130 sends a first acquisitionrequest to the first storage device 110 according to the replicationbitmap. The first acquisition request includes address information ofdata that needs to be acquired by the third storage device 130 from thefirst storage device 110. Step 630 is similar to step 503 in FIG. 5, andreference may be made to a relevant description on step 503 in theembodiment shown in FIG. 5 for details.

In step 632, the third storage device 130 receives data that is sent bythe first storage device 110 according to the first acquisition request.Step 632 is similar to step 506 in FIG. 5, and reference may be made toa relevant description on step 506 in the embodiment shown in FIG. 5 fordetails.

In step 634, the third storage device 130 updates the replication bitmapaccording to the received data. Specifically, the third storage device130 may update the replication bitmap according to the addressinformation of the received data. Specifically, when the replicationbitmap is being updated, a replication completion identifier may bemarked in a grid, in the replication bitmap, corresponding to an addressof data that has been replicated, or a flag bit in a grid, in thereplication bitmap, corresponding to an address of data that has beenreplicated may be deleted. For example, a flag bit “1” in a grid, in thereplication bitmap, corresponding to the address of the data that hasbeen replicated may be deleted. In an actual application, the thirdstorage device 130 may update the replication bitmap each time data isreceived, which is not limited herein.

In step 636, the third storage device 130 sends a second acquisitionrequest to the second storage device 120 according to the replicationbitmap. The second acquisition request includes address information ofdata that needs to be acquired by the third storage device 130 from thesecond storage device 120. It should be noted that, to prevent datareplicated by the first storage device 110 and the second storage device120 from being repeated, the data requested by using the secondacquisition request is different from the data requested by using thefirst acquisition request. This step is similar to step 504, andreference may be made to a relevant description on step 504 for details.

In step 638, the third storage device 130 receives data that is returnedby the second storage device 120 according to the second acquisitionrequest. In step 640, the third storage device 130 updates thereplication bitmap according to the received data sent by the secondstorage device 120. Step 638 to step 640 are respectively similar tostep 632 to step 634, and reference may be made to relevant descriptionson step 632 to step 634 for details.

It should be noted that, in this embodiment of the present disclosure,there is no sequence for step 630 to step 634 and step 636 to step 640,and the third storage device 130 may simultaneously send a requestseparately to the first storage device 110 and the second storage device120, so as to acquire data separately from the first storage device 110and the second storage device 120.

In step 642, if the third storage device 130 determines, according to anupdated replication bitmap, that the current replication task iscomplete, the third storage device 130 finishes the current replicationtask. In an actual application, step 630 to step 640 may be executedcircularly, and each time data is to be acquired, the third storagedevice 130 may send an acquisition request to the first storage device110 or the second storage device 120, and update the replication bitmapaccording to received data. When determining, according to the updatedreplication bitmap, that all data that needs to be replicated has beenreplicated, the third storage device 130 may finish the currentreplication task, and no longer send an acquisition request to the firststorage device 110 or the second storage device 120. It isunderstandable that, after the current replication task is completed,the third storage device 130 may delete the replication bitmap.

In still another situation, if in a replication process, the thirdstorage device 130 determines that a fault occurs on either of the firststorage device 110 and the second storage device 120, the third storagedevice 130 may send an acquisition request to a storage device on whichno fault occurs, to request the storage device, on which no faultoccurs, of the first storage device 110 and the second storage device120 to send, to the third storage device 130, data that has not beenreplicated. It is understandable that, the third storage device 130 maydetermine, according to whether a heartbeat signal of the first storagedevice 110 or the second storage device 120 is received within a settime or according to whether data sent by the first storage device 110or the second storage device 120 is received within a set time, whetherthe first storage device 110 or the second storage device 120 is faulty,and a method for how the third storage device 130 determines whether thefirst storage device 110 or the second storage device 120 is faulty isnot limited herein.

In the embodiment shown in FIG. 6a and FIG. 6b , the third storagedevice 130 may simultaneously acquire to-be-replicated data from thefirst storage device 110 and the second storage device 120 in the firststorage system 33. Therefore, replication link bandwidth is improved. Inaddition, when a fault occurs on one storage device in the first storagesystem 33, the third storage device 130 may continue to acquire datafrom a storage device on which no fault occurs, so that even though afault occurs on one of the storage devices, the replication task of thefirst storage system 33 may not be interrupted, thereby furtherenhancing system stability at the time of improving replicationefficiency.

It should be noted that, in the foregoing embodiments, all descriptionsare provided by using an example in which the first storage device 110and the second storage device 120 keep data consistency by using asynchronous replication technology. In an actual application, the firststorage device 110 and the second storage device 120 may also keepconsistency of stored data by means of asynchronous replication, as longas it is ensured that the first storage device 110 and the secondstorage device 120 in the first storage system 33 store same data whenthe replication task from the first storage system 33 to the secondstorage system 44 is started, which is not limited herein.

FIG. 7 is a schematic structural diagram of a storage system accordingto an embodiment of the present disclosure. The storage system shown inFIG. 7 may be the third storage device 130 in the level-2 site 13 shownin FIG. 1. The following describes the storage system shown in FIG. 7still with reference to FIG. 1. As shown in FIG. 7, the storage system70 may include:

a receiving module 702, configured to receive replication informationsent by a first storage system 33, where the replication information isused to indicate data that needs to be replicated by the first storagesystem 33 to the third storage device 130 in a current replication task,the first storage system 33 includes at least a first storage device 110and a second storage device 120, and the first storage device 110 andthe second storage device 120 store same data; and a sending module 704,configured to send a first acquisition request to the first storagedevice 110 according to the replication information, where the firstacquisition request includes information about data that needs to beacquired by the storage system 70 from the first storage device 110 inthe current replication task. It should be noted that, in a case inwhich the first storage device 110 includes multiple data volumes, theinformation, which is included in the first acquisition request, aboutthe requested data includes at least an identifier of a first sourcedata volume in the first storage device and an address of the datarequested by using the first acquisition request. The first source datavolume stores the data that needs to be replicated by the first storagesystem 33 to the third storage device 130 in the current replicationtask.

The sending module 704 is further configured to send a secondacquisition request to the second storage device 120 according to thereplication information, where the second acquisition request includesinformation about data that needs to be acquired by the storage system70 from the second storage device 120 in the current replication task,and the data requested by using the first acquisition request isdifferent from the data requested by using the second acquisitionrequest. It should be noted that, in a case in which the second storagedevice 120 includes multiple data volumes, the information, which isincluded in the second acquisition request, about the requested dataincludes at least an identifier of a second source data volume in thesecond storage device and an address of the data requested by using thesecond acquisition request. The second source data volume stores thedata that needs to be replicated by the first storage system 33 to thethird storage device 130 in the current replication task. Data stored inthe first source data volume is the same as data stored in the secondsource data volume.

The receiving module 702 is further configured to receive data that issent by the first storage device according to the first acquisitionrequest, and receive data that is sent by the second storage deviceaccording to the second acquisition request.

It is understandable that, the replication information may be areplication bitmap, the replication bitmap may be obtained according toa differential bitmap of the first storage system 33 when the currentreplication task begins, and the differential bitmap is used to recordinformation about data written to the first storage system 33 after aprevious replication task of the current replication task begins andbefore the current replication task begins.

In still another situation, when both the first storage device 110 andthe second storage device 120 include multiple data volumes, thereceiving module 702 is further configured to receive a replication taskstart message sent by the first storage system 33, where the replicationtask start message carries the identifier of the first source datavolume and the replication information that is determined according tothe data stored in the first source data volume. The storage system 70may further include:

a message processing module 708, configured to determine the secondsource data volume in the second storage device 120 and a destinationdata volume in the third storage device 130 according to the identifierof the first source data volume and a preset replication relationship.The replication relationship includes a correspondence among the firstsource data volume, the second source data volume, and the destinationdata volume, and the destination data volume is configured to store thedata received by the storage system in the current replication task.

In still another situation, the storage system 70 may further include:

a determining module 706, configured to determine, according to thereplication information and bandwidth of a link between the storagesystem 70 and the first storage device 110, the data that needs to beacquired from the first storage device 110, and determine, according tothe replication information and bandwidth of a link between the storagesystem 70 and the second storage device 120, the data that needs to beacquired from the second storage device 120.

The sending module 704 may be specifically configured to send the firstacquisition request to the first storage device according to the datathat is determined by the determining module 706 and that needs to beacquired from the first storage device, and send the second acquisitionrequest to the second storage device according to the data that isdetermined by the determining module 706 and that needs to be acquiredfrom the second storage device.

In still another situation, the storage system 70 may further include:

an updating module 710, configured to: in a process of executing thecurrent replication task, update the replication information accordingto the received data.

The storage system 70 provided by this embodiment of the presentdisclosure may execute the data replication methods described in theembodiments of FIG. 5, and FIG. 6a and FIG. 6b , for a detaileddescription of a function of each unit, reference may be made todescriptions in the method embodiments, and details are not describedherein again.

It is understandable that the embodiment shown in FIG. 7 is merelyexemplary. For example, the module division is merely logical functiondivision and may be other division in actual implementation. Forexample, a plurality of modules or components may be combined orintegrated into another device, or some features may be ignored or notperformed. In addition, the displayed or discussed mutual couplings ordirect couplings or communication connections may be implemented byusing some communications interfaces. The indirect couplings orcommunication connections between the modules may be implemented inelectronic, mechanical, or other forms.

Modules described as separate parts may or may not be physicallyseparate, and parts displayed as modules may or may not be physicalunits, may be located in one position, or may be distributed on aplurality of network units. Some or all of the modules may be selectedaccording to actual needs to achieve the objectives of the solutions ofthe embodiments.

An embodiment of the present disclosure further provides a computerprogram product for data processing, including a computer readablestorage medium that stores program code, where an instruction includedin the program code is used to perform the method and process of any oneof the foregoing method embodiments. A person of ordinary skill in theart may understand that, the foregoing storage medium includes variousnon-transitory machine readable media that can store program code, suchas a USB flash drive, a removable hard disk, a magnetic disk, an opticaldisc, a RAM, a SSD, or a non-volatile memory.

It should be noted that the embodiments provided in this application aremerely exemplary. A person skilled in the art may clearly understandthat, for the purpose of convenient and brief description, in theforegoing embodiments, the descriptions of the embodiments have theirrespective focuses. For a part that is not described in detail in anembodiment, reference may be made to related descriptions in otherembodiments. Features disclosed in the embodiments, claims andaccompanying drawings of the present disclosure may exist independentlyor exist as a combination. Features that are described in theembodiments of the present disclosure in a hardware manner may beexecuted by using software, and vice versa, and no limitation is madeherein.

What is claimed is:
 1. A data replication method applied to a firststorage system comprising a first storage device and a second storagedevice, the method comprising: determining replication information, bythe first storage system, wherein the replication information is used toindicate data that needs to be replicated by the first storage system toa second storage system in a current replication task, wherein the firststorage device and the second storage device store same data;determining first replication sub-information and second replicationsub-information according to the replication information, by the firststorage system, wherein the first replication sub-information is used toindicate data that needs to be replicated by the first storage device tothe second storage system in the current replication task, the secondreplication sub-information is used to indicate data that needs to bereplicated by the second storage device to the second storage system inthe current replication task, and the data indicated by the firstreplication sub-information is at least partially different from thedata indicated by the second replication sub-information; replicatingthe data that needs to be replicated by the first storage device to thesecond storage system according to the first replicationsub-information; and replicating the data that needs to be replicated bythe second storage device to the second storage system according to thesecond replication sub-information.
 2. The data replication methodaccording to claim 1, wherein the replicating, by the first storagedevice to the second storage system according to the first replicationsub-information, further comprises: replicating data that needs to bereplicated, by the first storage device to a destination data volume inthe second storage system, according to the first replicationsub-information, wherein the data that needs to be replicated is storedin a first source data volume; and replicating the data that needs to bereplicated, by the second storage device to the destination data volumein the second storage system, according to the second replicationsub-information, wherein the data that needs to be replicated is storedin a second source data volume; and wherein the data stored in the firstsource data volume and the data stored in the second source data volumeare the same.
 3. The data replication method according to claim 2,wherein the determining according to the replication information, by afirst storage system further comprises: sending, by the first storagedevice, a replication task start message to the second storage device,wherein the replication task start message carries an identifier of thefirst source data volume and an identifier of the destination datavolume; determining, by the second storage device, the second sourcedata volume in the second storage device according to the identifier ofthe first source data volume, the identifier of the destination datavolume, and a preset replication relationship, wherein the presetreplication relationship includes a correspondence between the firstsource data volume, the second source data volume, and the destinationdata volume; determining, by the second storage device, the replicationinformation according to the data stored in the determined second sourcedata volume; and determining, by the first storage device, thereplication information according to the data stored in the first sourcedata volume.
 4. The data replication method according to claim 2,wherein the determining according to the replication information, by afirst storage system, further comprises: determining, by the firststorage device, the replication information according to the data storedin the first source data volume; sending, by the first storage device, areplication task start message to the second storage device, wherein thereplication task start message carries an identifier of the first sourcedata volume, an identifier of the destination data volume, and thereplication information; and determining, by the second storage deviceaccording to the identifier of the first source data volume, theidentifier of the destination data volume, and a preset replicationrelationship, the second source data volume corresponding to thereplication information, wherein the preset replication relationshipincludes a correspondence between the first source data volume, thedestination data volume, and the second source data volume.
 5. The datareplication method according to claim 1, wherein the determining firstreplication sub-information and second replication sub-informationaccording to the replication information, by the first storage system,further comprises: determining the first replication sub-information, bythe first storage device, according to the replication information and apreset replication policy; and determining the second replicationsub-information, by the second storage device, according to thereplication information and the preset replication policy.
 6. The datareplication method according to claim 1, wherein the determining firstreplication sub-information and second replication sub-informationaccording to the replication information, by the first storage system,further comprises: receiving a replication negotiation request from thesecond storage device, by the first storage device, wherein thereplication negotiation request includes bandwidth information of a linkbetween the second storage device and the second storage system;determining the first replication sub-information and the secondreplication sub-information, by the first storage device, according tothe bandwidth information of the link; and sending, by the first storagedevice, the second replication sub-information to the second storagedevice.
 7. The data replication method according to claim 1, wherein thereplication information consists of the first replicationsub-information and the second replication sub-information, and themethod further comprises: in a process of executing the currentreplication task, generating, by the first storage device, firstreplication progress information according to data that has beenreplicated; sending, by the first storage device, the first replicationprogress information to the second storage device; when the firststorage device is faulty, determining, by the second storage deviceaccording to the first replication progress information, the secondreplication sub-information, and the replication information, data thathas not been replicated by the first storage device; and replicating, bythe second storage device to the second storage system, the data thathas not been replicated by the first storage device.
 8. The datareplication method according to claim 1, further comprising: in theprocess of executing the current replication task, updating the firstreplication sub-information, by the first storage device, according tothe data that has been replicated; and updating the second replicationsub-information, by the second storage device, according to data thathas been replicated.
 9. A data replication method, wherein the method isapplied to a storage system and comprises: receiving, by a secondstorage system, replication information sent by a first storage system,wherein the replication information is used to indicate data that needsto be replicated by the first storage system to the second storagesystem in a current replication task, the first storage system includesa first storage device and a second storage device, wherein the firststorage device and the second storage device store same data; sending,by the second storage system, a first acquisition request to the firststorage device according to the replication information, wherein thefirst acquisition request includes information about data that needs tobe acquired by the second storage system from the first storage devicein the current replication task; sending, by the second storage system,a second acquisition request to the second storage device according tothe replication information, wherein the second acquisition requestincludes information about data that needs to be acquired by the secondstorage system from the second storage device in the current replicationtask, and wherein the data requested by using the first acquisitionrequest is at least partially different from the data requested by usingthe second acquisition request; receiving, by the second storage system,data that is sent by the first storage device according to the firstacquisition request; and receiving, by the second storage system, datathat is sent by the second storage device according to the secondacquisition request.
 10. The method according to claim 9, wherein theinformation, included in the first acquisition request about therequested data comprises an identifier of a first source data volume inthe first storage device and an address of the data requested by usingthe first acquisition request; and the information, included in thesecond acquisition request about the requested data comprises anidentifier of a second source data volume in the second storage deviceand an address of the data requested by using the second acquisitionrequest, wherein both the first source data volume and the second sourcedata volume store the data that needs to be replicated by the firststorage system to the second storage system in the current replicationtask, and wherein data stored in the first source data volume and thedata stored in the second source data volume are the same.
 11. Themethod according to claim 10, wherein the receiving, by a second storagesystem, replication information sent by a first storage systemcomprises: receiving, by the second storage system, a replication taskstart message sent by the first storage system, wherein the replicationtask start message carries the identifier of the first source datavolume and the replication information that is determined according tothe data stored in the first source data volume; and the method furthercomprises: determining, by the second storage system, the second sourcedata volume in the second storage device and a destination data volumein the second storage system according to the identifier of the firstsource data volume and a preset replication relationship, wherein thepreset replication relationship includes a correspondence between thefirst source data volume, the second source data volume, and thedestination data volume, and wherein the destination data volume isconfigured to store the data received by the second storage system inthe current replication task.
 12. The method according to claim 9,wherein the sending, by the second storage system, a first acquisitionrequest to the first storage device according to the replicationinformation comprises: determining, by the second storage systemaccording to the replication information and bandwidth of a link betweenthe second storage system and the first storage device, the data thatneeds to be acquired from the first storage device; and sending, by thesecond storage system, the first acquisition request to the firststorage device according to the determined data that needs to beacquired from the first storage device; and the sending, by the secondstorage system, a second acquisition request to the second storagedevice according to the replication information comprises: determining,by the second storage system according to the replication informationand bandwidth of a link between the second storage system and the secondstorage device, the data that needs to be acquired from the secondstorage device; and sending, by the second storage system, the secondacquisition request to the second storage device according to thedetermined data that needs to be acquired from the second storagedevice.
 13. The method according to claim 9, further comprising: in aprocess of executing the current replication task, updating, by thesecond storage system, the replication information according to thereceived data.
 14. A storage system, wherein the storage systemcomprises: a first storage device and a second storage device, and thefirst storage device and the second storage device store same data,wherein the storage system is configured to: determine replicationinformation, wherein the replication information is used to indicatedata that needs to be replicated by the storage system to a secondstorage system in a current replication task; the storage system isfurther configured to: determine first replication sub-information and asecond replication sub-information according to the replicationinformation, wherein the first replication sub-information is used toindicate data that needs to be replicated by the first storage device tothe second storage system in the current replication task, the secondreplication sub-information is used to indicate data that needs to bereplicated by the second storage device to the second storage system inthe current replication task, and the data indicated by the firstreplication sub-information is at least partially different from thedata indicated by the second replication sub-information; the firststorage device is configured to replicate the data that needs to bereplicated by the first storage device to the second storage systemaccording to the first replication sub-information; and the secondstorage device is configured to replicate the data that needs to bereplicated by the second storage device to the second storage systemaccording to the second replication sub-information.
 15. The storagesystem according to claim 14, wherein the first storage device isconfigured to replicate the data that needs to be replicated by thefirst storage device to a destination data volume in the second storagesystem according to the first replication sub-information, wherein thedata that need to be replicated is stored in a first source data volume;and the second storage device is configured to replicate the data thatneeds to be replicated by the second storage device to the destinationdata volume in the second storage system according to the secondreplication sub-information, wherein the data that need to be replicatedis stored in a second source data volume, wherein data stored in thefirst source data volume and the data stored in the second source datavolume are the same.
 16. The storage system according to claim 15,wherein the first storage device is further configured to send areplication task start message to the second storage device, wherein thereplication task start message carries an identifier of the first sourcedata volume and an identifier of the destination data volume; the secondstorage device is further configured to: determine the second sourcedata volume in the second storage device according to the identifier ofthe first source data volume, the identifier of the destination datavolume, and a preset replication relationship, determine the replicationinformation according to the data stored in the determined second sourcedata volume, wherein the preset replication relationship includes acorrespondence between the first source data volume, the destinationdata volume, and the second source data volume; and the first storagedevice is further configured to determine the replication informationaccording to the data stored in the first source data volume.
 17. Thestorage system according to claim 15, wherein the first storage deviceis further configured to: determine the replication informationaccording to the data stored in the first source data volume, send areplication task start message to the second storage device, wherein thereplication task start message carries an identifier of the first sourcedata volume, an identifier of the destination data volume, and thereplication information; and the second storage device is furtherconfigured to determine, according to the identifier of the first sourcedata volume, the identifier of the destination data volume, and a presetreplication relationship, the second source data volume corresponding tothe replication information, wherein the replication relationshipincludes a correspondence between the first source data volume, thedestination data volume, and the second source data volume.
 18. Thestorage system according to claim 14, wherein the first storage deviceis further configured to determine the first replication sub-informationaccording to the replication information and a preset replicationpolicy; and the second storage device is further configured to determinethe second replication sub-information according to the replicationinformation and the preset replication policy.
 19. The storage systemaccording to claim 14, wherein the first storage device is furtherconfigured to: receive a replication negotiation request from the secondstorage device, wherein the replication negotiation request includesbandwidth information of a link between the second storage device andthe second storage system; determine the first replicationsub-information and the second replication sub-information according tothe bandwidth information of the link; and send the second replicationsub-information to the second storage device.
 20. The storage systemaccording to claim 14, wherein the replication information consists ofthe first replication sub-information and the second replicationsub-information; the first storage device is further configured to: in aprocess of executing the current replication task, generate firstreplication progress information according to data that has beenreplicated, send the first replication progress information to thesecond storage device; and the second storage device is furtherconfigured to: when the first storage device is faulty, determine datathat has not been replicated by the first storage device, according tothe first replication progress information, the second replicationsub-information, and the replication information, replicate the datathat has not been replicated by the first storage device to the secondstorage system.
 21. The storage system according to claim 14, whereinthe first storage device is further configured to update the firstreplication sub-information according to the data that has beenreplicated; and the second storage device is further configured toupdate the second replication sub-information according to data that hasbeen replicated.
 22. A storage system, comprising a controller and astorage, wherein the storage is configured to store data sent by a firststorage system; and the controller is configured to: receive replicationinformation sent by the first storage system, wherein the replicationinformation is used to indicate data that needs to be replicated by thefirst storage system to the storage system in a current replicationtask, the first storage system includes a first storage device and asecond storage device, the first storage device and the second storagedevice store same data; send a first acquisition request to the firststorage device according to the replication information, wherein thefirst acquisition request includes information about data that needs tobe acquired from the first storage device in the current replicationtask; send a second acquisition request to the second storage deviceaccording to the replication information, wherein the second acquisitionrequest includes information about data that needs to be acquired fromthe second storage device in the current replication task, and whereinthe data requested by using the first acquisition request is at leastpartially different from the data requested by using the secondacquisition request; receive data that is sent by the first storagedevice according to the first acquisition request; and receive data thatis sent by the second storage device according to the second acquisitionrequest.
 23. The storage system according to claim 22, wherein: theinformation, included in the first acquisition request, about therequested data comprises: an identifier of a first source data volume inthe first storage device and an address of the data requested by usingthe first acquisition request; and the information, included in thesecond acquisition request, about the requested data comprises: anidentifier of a second source data volume in the second storage deviceand an address of the data requested by using the second acquisitionrequest, wherein both the first source data volume and the second sourcedata volume store the data that needs to be replicated by the firststorage system to the storage system in the current replication task,and wherein data stored in the first source data volume and the datastored in the second source data volume are the same.
 24. The storagesystem according to claim 23, wherein in being configured to receivereplication information, the controller is configured to: receive areplication task start message sent by the first storage system, whereinthe replication task start message carries the identifier of the firstsource data volume and the replication information that is determinedaccording to the data stored in the first source data volume; thecontroller is further configured to: determine the second source datavolume in the second storage device and a destination data volume in thestorage system according to the identifier of the first source datavolume and a preset replication relationship, wherein the presetreplication relationship includes a correspondence between the firstsource data volume, the second source data volume, and the destinationdata volume, and wherein the destination data volume is configured tostore the data received by the storage system in the current replicationtask.
 25. The storage system according to claim 22, wherein thecontroller is further configured to: determine, according to thereplication information and bandwidth of a link between the storagesystem and the first storage device, the data that needs to be acquiredfrom the first storage device; and determine, according to thereplication information and bandwidth of a link between the storagesystem and the second storage device, the data that needs to be acquiredfrom the second storage device; wherein in being configured to send thefirst acquisition request to the first storage device, the controller isconfigured to: send the first acquisition request to the first storagedevice according to the determined data that needs to be acquired fromthe first storage device; wherein in being configured to send the secondacquisition request to the second storage device, the controller isconfigured to: send the second acquisition request to the second storagedevice according to the determined data that needs to be acquired fromthe second storage device.
 26. The storage system according to claim 22,wherein the controller is further configured to: update the replicationinformation according to the received data in a process of executing thecurrent replication task.